Text Mining Question Bank
Natural Language Processing
- Give 5 examples for Holonyms, Hyponyms, Hypernyms, Metonyms, Meronyms, Homonyms, Synonyms, Polysems.
- Draw the Venn diagram of Spellings-Meanings-Pronunciations.
- Why are Context Free Grammars Context free ?
- What is the difference between RTN and ATN ?
- Give examples of Prepositional Phrases.
- Compare CFG and ATN.
- Give 5 examples for Anaphora, Cataphora, Endophora, Exophora.
- Give 5 examples of NP ellipsis, VP ellipsis.
- Write a CFG, ATN for the following:
- “Tech Companies queue up for Open Source Professionals”.
- I love my language.
- Patriotism is not about watching cricket matches together.
- AMD’s microcode is more richer than Intel.
- Ron Weasley should marry Hermoine Granger.
- Krishna is a metonym for uncertainty.
- PMPO is 8 times that of RMS power measured for a 1KHz signal with an amplitude of 1V.
- What are the Named Entities in
- “Open Source helps Life Spring Hospitals” ?
- I want to work for Burning Glass Technologies Inc.
- The university life at SRM is very informal.
- AMD Phenom 5500 Black Edition can be unleashed to 4 cores.
- Hail Hitler!
- Anushka is taller than Surya.
- Do NP chunking on
- Tips and Tools for measuring the world and beating the odds
- The crazy frog is an awesome song
- Time flies like arrow.
- Thevaram was written by Appar.
- Text mining is awfully interesting.
- I need to get placed is a good company.
- Write a Regular Expression for replacing the beginning and end of all the lines in a text file with the strings “<BOL>
” and “<EOL> ” respectively. - Write a regular expression for capturing Indian mobile numbers, land line numbers and Indian pin codes with maximum possible inherent validation.
- Write a regular expression for capturing the vehicle numbers, PAN numbers, Passport numbers in a new paper article.
- Identify rules to capturing dates and discriminating the job dates, education dates and date of birth.
- Give examples for Noun stemming in English & {Tamil or Telugu or Hindi} languages. Transliterate the Indian language.
- Give examples for Verb stemming in English & {Tamil or Telugu or Hindi} languages. Transliterate the Indian language.
- How does a spell checker work ?
- Take some arbitrary texts and summarize them in to a line or two. Justify the reason for the choice of words and sentences in your summary.
- Show some examples for word-by-word, sentence-by-sentence, context-by-context machine translation.
Information Extraction & Statistical NLP
- If Prob(A) is 0.4 and Prob(B) is 0.6, what is Prob(A,B), Prob(A|B), Prob(A u B), Prob(A – B), Prob(A n B) ? If some data is missing, assume a reasonable value for it.
- Let A be a random variable with instances a1, a2, a3, a4, a5. If P(a1) = 1.8e-4, P(a2) = 5.2e-8, P(a3) = 0.042, P(a4) = 0.00052, P(a5)=0.2, compute ∑P(A), ∏P(A) without mathematical underflow.
- Give real life examples for 1st order markov processes.
- Give real life examples of Expectation-Maximization.
Powered by ScribeFire.

Developer Camp 2010
I had asked a couple of the audience boys to go for hunting more audience for the talk. See I were to advertise and promote my talk, which in fact is critical for everything in the world we live. One of the volunteers advised to use a microphone and start the talk. When I started the talk, I was surprised to see that people walked in to fill up the hall. The talk went on and on with a lot of interesting examples which made everyone introspect about the way we see and assess our neighbourhood. I am sure my audience have understood now that everything that we see around and solve could be mathematically modeled and be solved using computers. Hurray, we made it!!













