Classes | |
| class | ContextReader |
| Class for reading the left and right context of a token. More... | |
| class | CorpusManager |
| Class for managing corpus repositories. More... | |
| class | IndexManager |
| Class for indexing and removing documents from the index. More... | |
| class | SearchEngine |
| Class that provides functionality for querying a corpus. More... | |
| class | StatisticsEngine |
| Class that provides functionality to do statistical calculations on a corpus. More... | |
| class | Token |
| Virtual base class for all Token descendants. More... | |
| class | EmptyToken |
| A Token without a token string nor any occurrences. More... | |
| class | CorpusTokenBase |
| Base class for all CorpusToken classes. More... | |
| class | CorpusToken |
| A Token of only one word. More... | |
| class | LightCorpusToken |
| A corpus token that only carries corpus frequency information (opposed to CorpusToken). More... | |
| class | MutableToken |
| A Token whose OccurrenceMap can be modified. The main purpose of MutableToken is to be a base class for phrases, whose occurrence matrix needs to be modified as phrases are themselve not indexed. More... | |
| class | Phrase |
| A phrasal token consists of several words or tokens. More... | |
| class | TokenContext |
| Class that holds one specific context to a Token. More... | |
| class | Input |
| class | StringInput |
| class | FileInput |
| class | Tokenizer |
| class | TextSaver |
Namespaces | |
| namespace | Exceptions |
Typedefs | |
| typedef uint32_t | DocID |
| The ID of a document in the data base. | |
| typedef uint32_t | TokenID |
| The ID of Token in the data base. | |
| typedef uint32_t | IdxPos |
| The byte offset of a Token in a document. | |
| typedef std::vector< IdxPos > | PositionList |
| The positions of a Token in a document. | |
| typedef std::map< DocID, PositionList > | OccurrenceMap |
| Documents and positions for a Token. | |
| typedef boost::shared_ptr< Token > | TokenPtr |
| Pointer to a Token object. | |
| typedef boost::shared_ptr< TokenContext > | TokenContextPtr |
| Pointer to a TokenContext object. | |
| typedef std::vector< TokenPtr > | TokenVector |
| A vector of TokenPtr. | |
| typedef std::tr1::unordered_map< std::string, std::pair< TokenID, PositionList > > | idx_map |
Variables | |
| static const TokenID | InvalidTokenID = 0 |
| static const DocID | InvalidDocID = 0 |
| typedef uint32_t PhraseHunter::DocID |
| typedef std::tr1::unordered_map<std::string, std::pair<TokenID, PositionList> > PhraseHunter::idx_map |
Definition at line 43 of file indexer.cpp.
| typedef uint32_t PhraseHunter::IdxPos |
| typedef std::map<DocID, PositionList> PhraseHunter::OccurrenceMap |
| typedef std::vector<IdxPos> PhraseHunter::PositionList |
| typedef boost::shared_ptr<TokenContext> PhraseHunter::TokenContextPtr |
| typedef uint32_t PhraseHunter::TokenID |
| typedef boost::shared_ptr<Token> PhraseHunter::TokenPtr |
| typedef std::vector<TokenPtr> PhraseHunter::TokenVector |
const DocID PhraseHunter::InvalidDocID = 0 [static] |
Definition at line 68 of file ph_types.h.
const TokenID PhraseHunter::InvalidTokenID = 0 [static] |
Definition at line 67 of file ph_types.h.
Referenced by PhraseHunter::Token::id(), and PhraseHunter::CorpusToken::loadFromCorpus().
1.5.1