a simple datastructure to hold all frogged information of one word More...

#include <FrogData.h>

Collaboration diagram for frog_record:

[legend]

Public Member Functions
	frog_record ()
	default constructor More...

	~frog_record ()
	default destructor More...

nlohmann::json	to_json () const

Public Attributes
std::string	word
	the word in UTF8 More...

std::string	clean_word
	lowercased word (MBMA only) More...

std::string	token_class
	the assigned token class of the word More...

std::string	language
	the deteected language of the word More...

bool	no_space
	was there a space after the word? More...

bool	new_paragraph
	did the tokenizer detect a paragraph here? More...

std::string	tag
	the assigned POS tag More...

double	tag_confidence
	the confidence of the POS tag More...

std::string	iob_tag
	the assigned IOB tag More...

double	iob_confidence
	the confidence of the IOB tag More...

std::string	ner_tag
	the assigned NER tag More...

double	ner_confidence
	the confidence of the NER tag More...

std::vector< std::string >	lemmas
	a list of possible lemma's More...

std::vector< std::vector< std::string > >	morphs
	the morph analysis More...

std::vector< const BaseBracket * >	deep_morphs
	pointers to the deep morphemes More...

std::string	compound_string
	string representation of first compound More...

std::string	morph_string
	string representation of first morph More...

std::string	deep_morph_string
	string representation of first deep_morph More...

int	parse_index
	label of the dependency More...

std::string	parse_role
	role of the dependency More...

std::set< size_t >	parts
	set of indeces a MWU is made of (MWU only) More...

Detailed Description

a simple datastructure to hold all frogged information of one word

Constructor & Destructor Documentation

◆ frog_record()

frog_record::frog_record ( )

default constructor

◆ ~frog_record()

frog_record::~frog_record ( )

default destructor

Member Function Documentation

◆ to_json()

json frog_record::to_json ( ) const

format a frog_record fd into a json structure

Returns: an JSON structure

Member Data Documentation

◆ clean_word

std::string frog_record::clean_word

lowercased word (MBMA only)

◆ compound_string

std::string frog_record::compound_string

string representation of first compound

◆ deep_morph_string

std::string frog_record::deep_morph_string

string representation of first deep_morph

◆ deep_morphs

std::vector<const BaseBracket*> frog_record::deep_morphs

pointers to the deep morphemes

◆ iob_confidence

double frog_record::iob_confidence

the confidence of the IOB tag

◆ iob_tag

std::string frog_record::iob_tag

the assigned IOB tag

◆ language

std::string frog_record::language

the deteected language of the word

◆ lemmas

std::vector<std::string> frog_record::lemmas

a list of possible lemma's

◆ morph_string

std::string frog_record::morph_string

string representation of first morph

◆ morphs

std::vector<std::vector<std::string> > frog_record::morphs

the morph analysis

◆ ner_confidence

double frog_record::ner_confidence

the confidence of the NER tag

◆ ner_tag

std::string frog_record::ner_tag

the assigned NER tag

◆ new_paragraph

bool frog_record::new_paragraph

did the tokenizer detect a paragraph here?

◆ no_space

bool frog_record::no_space

was there a space after the word?

◆ parse_index

int frog_record::parse_index

label of the dependency

◆ parse_role

std::string frog_record::parse_role

role of the dependency

◆ parts

std::set<size_t> frog_record::parts

set of indeces a MWU is made of (MWU only)

◆ tag

std::string frog_record::tag

the assigned POS tag

◆ tag_confidence

double frog_record::tag_confidence

the confidence of the POS tag

◆ token_class

std::string frog_record::token_class

the assigned token class of the word

◆ word

std::string frog_record::word

the word in UTF8

The documentation for this class was generated from the following files:

include/frog/FrogData.h
src/FrogData.cxx

Public Member Functions

Public Attributes