Normal view MARC view ISBD view

Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 / Jacob Perkins ; cover image by Faiz Fattohi.

By: Perkins, Jacob [author.].

Contributor(s): Fattohi, Faiz [cover designer.].

Material type: Text

TextPublisher: Birmingham, England : Packt Publishing Ltd, 2014Copyright date: ©2014Description: 1 online resource (304 pages) : Rs. 849.00 illustrations.Content type: text Media type: computer Carrier type: online resourceISBN: 9781782167860; 1782167862; 1782167854; 9781782167853.Subject(s): COMPUTERS -- Natural Language Processing | Python (Computer program language) -- Juvenile literature | Natural language processing (Computer science) -- Research | COMPUTERS -- Programming Languages -- PythonGenre/Form: Electronic books.DDC classification: 005.133

Contents:

Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Tokenizing Text and WordNet Basics; Introduction; Tokenizing text into sentences; Tokenizing sentences into words; Tokenizing sentences using regular expressions; Training a sentence tokenizer; Filtering stopwords in a tokenized sentence; Looking up Synsets for a word in WordNet; Looking up lemmas and synonyms in WordNet; Calculating WordNet Synset similarity; Discovering word collocations; Chapter 2: Replacing and Correcting Words; Introduction; Stemming words.

Lemmatizing words with WordNetReplacing words matching regular expressions; Removing repeating characters; Spelling correction with Enchant; Replacing synonyms; Replacing negations with antonyms; Chapter 3: Creating Custom Corpora; Introduction; Setting up a custom corpus; Creating a wordlist corpus; Creating a part-of-speech tagged word corpus; Creating a chunked phrase corpus; Creating a categorized text corpus; Creating a categorized chunk corpus reader; Lazy corpus loading; Creating a custom corpus view; Creating a MongoDB-backed corpus reader; Corpus editing with file locking.

Chapter 4: Part-of-speech TaggingIntroduction; Default tagging; Training a unigram part-of-speech tagger; Combining taggers with backoff tagging; Training and combining ngram taggers; Creating a model of likely word tags; Tagging with regular expressions; Affix tagging; Training a Brill tagger; Training the TnT tagger; Using WordNet for tagging; Tagging proper names; Classifier-based tagging; Training a tagger with NLTK-Trainer; Chapter 5: Extracting Chunks; Introduction; Chunking and chinking with regular expressions; Merging and splitting chunks with regular expressions.

Expanding and removing chunks with regular expressionsPartial parsing with regular expressions; Training a tagger-based chunker; Classification-based chunking; Extracting named entities; Extracting proper noun chunks; Extracting location chunks; Training a named entity chunker; Training a chunker with NLTK-Trainer; Chapter 6: Transforming Chunks and Trees; Introduction; Filtering insignificant words from a sentence; Correcting verb forms; Swapping verb phrases; Swapping noun cardinals; Swapping infinitive phrases; Singularizing plural nouns; Chaining chunk transformations.

Converting a chunk tree to textFlattening a deep tree; Creating a shallow tree; Converting tree labels; Chapter 7: Text Classification; Introduction; Bag of words feature extraction; Training a Naive Bayes classifier; Training a decision tree classifier; Training a maximum entropy classifier; Training scikit-learn classifiers; Measuring precision and recall of a classifier; Calculating high information words; Combining classifiers with voting; Classifying with multiple binary classifiers; Training a classifier with NLTK-Trainer; Chapter 8: Distributed Processing and Handling Large Datasets.

List(s) this item appears in: 2018-03-28

Holdings ( 1 )
Title notes

Item type	Current location	Call number	Status	Date due	Barcode	Item holds
Book	Chennai Mathematical Institute General Stacks	005.133 PER (Browse shelf)	Available		10347

Total holds: 0

Browsing Chennai Mathematical Institute Shelves , Shelving location: General Stacks Close shelf browser

Previous								Next
Previous	005.114 BIR Thinking functionally with Haskell /	005.13 ASP The optimal implementation of functional programming languages /	005.133 CHO Deep learning with Python /	005.133 PER Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 /	005.133 WAL Java : the good parts /	005.133 WEL Essentials of programming in Mathematica /	005.14 CLA Model checking /	Next

"Quick answers to common problems"--Cover.

Includes index.

Online resource; title from PDF title page (ebrary, viewed September 2, 2014).

CMI Library

Python 3 text processing with NLTK 3 cookbook : over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0 / Jacob Perkins ; cover image by Faiz Fattohi.

By: Perkins, Jacob [author.].

Contributor(s): Fattohi, Faiz [cover designer.].

Browsing Chennai Mathematical Institute Shelves , Shelving location: General Stacks Close shelf browser