8000 Index datastructure map of docid -> docno · Issue #25 · ten-blue-links/fxt · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Index datastructure map of docid -> docno #25
Open
@lgrz

Description

@lgrz

The feature extraction program extract_features is bound to calling on Indri's metadata. This could be handled by a (docid, docno) map file that is created by Tesserae at index time.

The following could be replaced with a map lookup:

std::vector<docid_t> docids = qry_env.document_ids_from_metadata("docno", docnos);

This also has the benefit of reducing the dependency on Indri to the programs that absolutely require it (i.e the indexing programs).

Related to #13 improving index component.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0