Frog
|
a simple datastructure to hold all frogged information of one word More...
#include <FrogData.h>
Public Member Functions | |
frog_record () | |
default constructor More... | |
~frog_record () | |
default destructor More... | |
nlohmann::json | to_json () const |
Public Attributes | |
std::string | word |
the word in UTF8 More... | |
std::string | clean_word |
lowercased word (MBMA only) More... | |
std::string | token_class |
the assigned token class of the word More... | |
std::string | language |
the deteected language of the word More... | |
bool | no_space |
was there a space after the word? More... | |
bool | new_paragraph |
did the tokenizer detect a paragraph here? More... | |
std::string | tag |
the assigned POS tag More... | |
double | tag_confidence |
the confidence of the POS tag More... | |
std::string | iob_tag |
the assigned IOB tag More... | |
double | iob_confidence |
the confidence of the IOB tag More... | |
std::string | ner_tag |
the assigned NER tag More... | |
double | ner_confidence |
the confidence of the NER tag More... | |
std::vector< std::string > | lemmas |
a list of possible lemma's More... | |
std::vector< std::vector< std::string > > | morphs |
the morph analysis More... | |
std::vector< const BaseBracket * > | deep_morphs |
pointers to the deep morphemes More... | |
std::string | compound_string |
string representation of first compound More... | |
std::string | morph_string |
string representation of first morph More... | |
std::string | deep_morph_string |
string representation of first deep_morph More... | |
int | parse_index |
label of the dependency More... | |
std::string | parse_role |
role of the dependency More... | |
std::set< size_t > | parts |
set of indeces a MWU is made of (MWU only) More... | |
a simple datastructure to hold all frogged information of one word
frog_record::frog_record | ( | ) |
default constructor
frog_record::~frog_record | ( | ) |
default destructor
json frog_record::to_json | ( | ) | const |
format a frog_record fd into a json structure
std::string frog_record::clean_word |
lowercased word (MBMA only)
std::string frog_record::compound_string |
string representation of first compound
std::string frog_record::deep_morph_string |
string representation of first deep_morph
std::vector<const BaseBracket*> frog_record::deep_morphs |
pointers to the deep morphemes
double frog_record::iob_confidence |
the confidence of the IOB tag
std::string frog_record::iob_tag |
the assigned IOB tag
std::string frog_record::language |
the deteected language of the word
std::vector<std::string> frog_record::lemmas |
a list of possible lemma's
std::string frog_record::morph_string |
string representation of first morph
std::vector<std::vector<std::string> > frog_record::morphs |
the morph analysis
double frog_record::ner_confidence |
the confidence of the NER tag
std::string frog_record::ner_tag |
the assigned NER tag
bool frog_record::new_paragraph |
did the tokenizer detect a paragraph here?
bool frog_record::no_space |
was there a space after the word?
int frog_record::parse_index |
label of the dependency
std::string frog_record::parse_role |
role of the dependency
std::set<size_t> frog_record::parts |
set of indeces a MWU is made of (MWU only)
std::string frog_record::tag |
the assigned POS tag
double frog_record::tag_confidence |
the confidence of the POS tag
std::string frog_record::token_class |
the assigned token class of the word
std::string frog_record::word |
the word in UTF8