Some parts of this website may do not work correctly, because your browser doesn't support JavaScript or you have disabled it. In order to use all features please enable JavaScript in your browser.

Specification for annotator > parser > gobio

gobio

A deep parser based on the parser used in Translatica machine translation system.

Gobio operates on morfologically annotated text.

Gobio has some predefined sets of rules for several languages, but if you want, you can provide your own rule file with --rules option. The rules for gobio are in general a kind of context-free grammar rules. The tutorial on how to write your own rule sets for gobio is being prepared.

Explanation of symbols used to denote grammatical categories for Polish:

  • C (czasownik) - verb
  • LG (liczebnik główny) - cardinal numeral
  • P (przymiotnik) - adjective
  • PR (przyimek) - preposition
  • PS (przysłówek) - adverb
  • R (rzeczownik) - noun
  • S (spójnik) - conjunction
  • ZP (zaimek przymiotny) - adjective pronoun
  • ZRn (zaimek rzeczowny nieokreślony) - indefinite noun pronoun
  • ZRo (zaimek rzeczowny osobowy) - personal noun pronoun
  • ZRs (zaimek „się”) - pronoun “się”
  • ZRw (zaimek wskazujący) - demonstrative pronoun
  • ZS (zaimek przysłowny) - adverbial pronoun

Aliases

parse, parse-generator, parser

Languages

de, pl, test

Examples

--line-by-line gobio --lang pl --terminal-tag parse-terminal ! bracketing-writer --disamb --tags parse --opening-bracket %c[

Parse Polish sentences line by line and print simplified constituent tree for each sentence.

in:
Komputer czyta zdania.
Każde zdanie ma swoje drzewo składniowe.
Zrobiłem już trzy zdania.
out:
FR[R[Komputer]] fin[czyta] FR[R[zdania]].
FR[ZP[Każde] R[zdanie]] FP[P[ma]] FR[ZP[swoje] R[drzewo] FP[P[składniowe]]].
praet[Zrobiłem] FPS[PS[już]] FR[LG[trzy] R[zdania]].

Options

Allowed options:
  --lang arg (=guess)                   language
  --force-language                      force using specified language even if 
                                        a text was resognised otherwise
  --edge-number-limit arg (=-1)         maximal number of edges inserted 
                                        between each two vertices
  --rules arg (=%ITSDATA%/%LANG%/rules.g)
                                        file with rules in text format
  --terminal-tag arg (=parse-terminal)  tag for terminal

Other help resources