BookRags.com Literature Guides Literature
Guides
Criticism & Essays Criticism &
Essays
Questions & Answers Questions &
Answers
Lesson Plans Lesson
Plans
My Bibliography Periodic Table U.S. Presidents Shakespeare Sonnet Shake-Up
Research Anything:        
History | Encyclopedias | Films | News | Create a Bibliography | More... Login | Register | Help
Not What You Meant?  There are 15 definitions for Brill.

Brill tagger

Print-Friendly
About 1 pages (283 words)

Bookmark and Share Know this topic well? Help others and get FREE products!

The Brill tagger is a method for doing part-of-speech tagging. It was described by Eric Brill in his 1993 PhD thesis [1]. It can be summarized as an "error-driven transformation-based tagger". It is

  • error-driven in the sense that it recourses to supervised learning
  • transformation-based in the sense that a tag is assigned to each word and changed using a set of predefined rules. Note: If the word is known, it first assigns the most frequent tag, or if the word is unknown, it naively assigns the tag "noun" to it. Applying over and over these rules, changing the incorrect tags, a quite high accuracy is achieved.

Algorithm

The algorithm goes as follows:

  • Initialisation:
    • Known words (in vocabulary): assigning the most frequent tag associated to a form of the word
    • Unknown words (out of vocabulary) :
      • Proper noun if capitalised and simple noun else (1992)
      • Learning or guessing rules on the same basis as contextual rules (1994)
  • Learning Phase
    • Iteratively compute the error score of each candidate rule (difference between the number of errors before and after applying the rule)
    • Select the best (higher score) rule.
    • Add it to the rule set and apply it to the text.
    • Repeat until no rule has a score above a given threshold (that is, until applying new rules leaves the text in the same state, which is then supposed to be the final state of the tagging).

Rules

Lexical rules are used for the initialisation, and contextual rules are used to correct the tags.

  • Lexical rules: wordtag IF Condition (example: identification of suffixes like "-tion")
  • Contextual rules: tag1tag2 IF Condition (example: "preceding/following tag is X", "preceding/following word is w")

View More Summaries on Brill tagger
 
Ask any question on Brill tagger and get it answered FAST!
Answer questions in BookRags Q&A and earn points toward
discounted or even FREE Study Guides and other BookRags products!
Learn more about BookRags Q&A
Copyrights
Brill tagger from Wíkipedia. ©2006 by Wíkipedia. Licensed under the GNU Free Documentation License. View a list of authors or edit this article.

Article Navigation
Join BookRagslearn moreJoin BookRags




About BookRags | Customer Service | Report an Error | Terms of Use | Privacy Policy