What is the process of converting text data into a numerical format so that it can be used in machine learning algorithms called?
A. Data Encoding
B. Data Tokenization
C. Data Parsing
D. Data Transformation
Which data collection method involves selecting a sample from a larger population and collecting data from the sample only?
A. Experiments
B. Observational Studies
C. Surveys
D. Sampling
In data preprocessing, what is the purpose of data scaling or normalization?
A. To increase data complexity
B. To standardize data
C. To remove outliers
D. To create new features
What type of data collection method involves manipulating one or more variables to observe their effects on other variables?
A. Observations
B. Surveys
C. Experiments
D. Interviews
Which of the following is a common technique for handling outliers in a dataset?
A. Data Imputation
B. Data Transformation
C. Data Visualization
D. Data Scaling
Which type of data collection method involves asking questions to gather information from respondents?
A. Observations
B. Experiments
C. Surveys
D. Interviews
What is the process of transforming categorical data into numerical values called?
A. Data Encoding
B. Data Imputation
C. Data Transformation
D. Data Standardization
Which of the following is NOT a common step in data preprocessing?
A. Data Cleaning
B. Data Visualization
C. Data Transformation
D. Data Integration
In data preprocessing, what is the term for the identification and removal of duplicate or redundant data?
A. Data Deduplication
B. Data Aggregation
C. Data Normalization
D. Data Imputation
Which of the following is a common technique for handling imbalanced data in classification problems?
A. Data Augmentation
B. Data Scaling
C. Data Encoding
D. Data Imputation