AM
BUAL5660 FINAL EXAM QUESTIONS AND ANSWERS
WITH COMPLETE SOLUTIONS VERIFIED
Leave the first rating
Save
What must be done to unstructured We must impose a structure ondata to convert it into a structured form
data in order to complete analysis?
Stop Words Those words that which we do NOT want in our analysis
Bag of Words Another form of Term by Document Matrix
Higher TFIDF More frequency in one document, but less across multiple documents
Support How many times all the products in a rule are brought together
What is a hidden neruon? A linear combination of neurons in the previous layer
Filter A metric containing some weights
Keras package Used in Python for deep learning and convolutional neural network
To control overfitting,_______is an Dropout
important approach in neural
networks
An application-programming interface (API) is a set of programming
API
instructions and standards for accessing a Web-based software
application or Web tool.
To find information on the hundreds of millions of Web pages that
Web Crawling exist, a search engine employs special software robots, called spiders,
to build lists of the words found on Websites
Text Analytics information retrieval + information extraction + data mining + web mining
A semi-automated process of extracting knowledge from unstructured
Text Mining
data sources
The nature of the data:
Differences between text mining and - Structured vs. unstructured data
data mining - Structured: in databases
- Unstructured: Word documents, PDF files, etc
1. Establish the Corpus
Text Mining Process 2.Create the Term-Document Matrix
3.Extract Knowledge
Tokenization The process of breaking up a given text into units called tokens
Remove inflectional endings only and to return the base or dictionary
Lemmatization
form of the word
Sentiment Belief, view, opinion, and conviction
Sentiment is trying to answer what "What do people feel about a certain topic?"
question?
1. Sentiment detection
2.N-P Polarity classification
Sentiment Analysis Process
3.Target identification
4. Collection and aggregation
Finds subsets that are common to at least a minimum number of the itemsets
Apriori Algortihm
- Uses a bottom-up approach
1/
3