PythonTutorials.net
Toggle Menu
Home
Online Python Compiler
Tutorials
Django
Flask
Scikit-Learn
NumPy
NLTK
Pillow
Blog
All Posts
Natural Language Processing with NLTK
Evaluate your skills in tokenization,stemming,tagging,and text classification using NLTK.
1. What is the primary function of nltk.word_tokenize()?
Splits text into sentences
Splits text into individual words/tokens
Performs part-of-speech tagging
2. Which NLTK module provides access to stopwords?
nltk.corpus.stopwords
nltk.tokenize
nltk.stem
3. What does POS stand for in the context of NLTK?
Primary Object Syntax
Part Of Speech
Paragraph Organization Structure
4. Which NLTK stemmer is known for its aggressive stemming approach?
PorterStemmer
LancasterStemmer
SnowballStemmer
5. What is the output format of nltk.pos_tag(tokens)?
A list of words
A list of (word, POS tag) tuples
A single string of tags
6. Which NLTK resource is a lexical database of English words and their semantic relationships?
Brown Corpus
WordNet
Reuters Corpus
7. What task does the nltk.sent_tokenize() function perform?
Word tokenization
Sentence segmentation
Stemming
8. Which function is used to download NLTK resources (e.g., corpora, models)?
nltk.install()
nltk.download()
nltk.get()
9. Select all NLTK stemming algorithms from the options below:
PorterStemmer
WordNetLemmatizer
LancasterStemmer
SnowballStemmer
10. Which of the following are NLTK corpora (text collections)?
Brown Corpus
Reuters Corpus
WordNet
Gutenberg Corpus
11. Which part-of-speech tags represent verb forms in NLTK's Penn Treebank tagset?
VB
NN
VBD
JJ
12. Select tasks that NLTK is commonly used for:
Tokenization
Machine Translation
Part-of-Speech Tagging
Lemmatization
13. Which of these are components of the NLTK library?
Tokenizers
Neural Network Layers
Parsers
Corpora
14. NLTK is an open-source Python library for Natural Language Processing.
True
False
15. Lemmatization in NLTK always requires specifying a part-of-speech tag to produce accurate results.
True
False
16. The nltk.corpus.stopwords.words('english') function returns a list of common English stopwords.
True
False
17. NLTK's word_tokenize() function can tokenize text in any language without additional resources.
True
False
18. What does the acronym 'NLTK' stand for?
19. Name the NLTK class used for lemmatization (provide the full class name, including module if necessary).
20. What is the standard abbreviation for the part-of-speech tag that represents a plural noun in NLTK's Penn Treebank tagset?
Reset
Answered 0 of 0 — 0 correct