Lecture 2: Lexical association measures and hypothesis testing

Pre-lecture Readings. Lexical association Named entities: http://www.nltk.org/book/ch07.html Information extraction architecture raw text->sentence segmentation->takenization0<part of speech tagging->entity detection->relation detection chunking: segments and labels multi-token sequences as illustrated in 2.1. Noun-phrase (NP) chunking tag patterns: describe sequences of tagged words Chunking with Regular Expressions Exploring Text Corpora Chinking: define a chink to be a sequence of tokens that is not included in … Read moreLecture 2: Lexical association measures and hypothesis testing

Leave a Reply

Your email address will not be published. Required fields are marked *