Machine Learning

You are part of data science team that is working for a national fast-food chain. You create a simple report that shows trend: Customers who visit the store more often and buy smaller meals spend more than customers who visit less frequently and buy larger meals. What is the most likely diagram that your team created?

1.

multiclass classification diagram

2.

linear regression and scatter plots

3.

pivot table

4.

K-means cluster diagram

Q 1 / 92

Machine Learning

You work for an organization that sells a spam filtering service to large companies. Your organization wants to transition its product to use machine learning. It currently a list Of 250,00 keywords. If a message contains more than few of these keywords, then it is identified as spam. What would be one advantage of transitioning to machine learning?

1.

The product would look for new patterns in spam messages.

2.

The product could go through the keyword list much more quickly.

3.

The product could have a much longer keyword list.

4.

The product could find spam messages using far fewer keywords.

Q 2 / 92

Machine Learning

You work for a music streaming service and want to use supervised machine learning to classify music into different genres. Your service has collected thousands of songs in each genre, and you used this as your training data. Now you pull out a small random subset of all the songs in your service. What is this subset called?

1.

data cluster

2.

Supervised set

3.

big data

4.

test data

Q 3 / 92

Machine Learning

In traditional computer programming, you input commands. What do you input with machine learning?

1.

patterns

2.

programs

3.

rules

4.

data

Q 4 / 92

Machine Learning

Your company wants to predict whether existing automotive insurance customers are more likely to buy homeowners insurance. It created a model to better predict the best customers contact about homeowners insurance, and the model had a low variance but high bias. What does that say about the data model?

1.

It was consistently wrong.

2.

It was inconsistently wrong.

3.

It was consistently right.

4.

It was equally right end wrong.

Q 5 / 92

Machine Learning

You want to identify global weather patterns that may have been affected by climate change. To do so, you want to use machine learning algorithms to find patterns that would otherwise be imperceptible to a human meteorologist. What is the place to start?

1.

Find labeled data of sunny days so that the machine will learn to identify bad weather.

2.

Use unsupervised learning have the machine look for anomalies in a massive weather database.

3.

Create a training set of unusual patterns and ask the machine learning algorithms to classify them.

4.

Create a training set of normal weather and have the machine look for similar patterns.

Q 6 / 92

Machine Learning

You work in a data science team that wants to improve the accuracy of its K-nearest neighbor result by running on top of a naive Bayes result. What is this an example of?

1.

regression

2.

boosting

3.

bagging

4.

stacking

Q 7 / 92

Machine Learning

`____` looks at the relationship between predictors and your outcome.

1.

Regression analysis

2.

K-means clustering

3.

Big data

4.

Unsupervised learning

Q 8 / 92

Machine Learning

What is an example of a commercial application for a machine learning system?

1.

a data entry system

2.

a data warehouse system

3.

a massive data repository

4.

a product recommendation system

Q 9 / 92

Machine Learning

What does this image illustrate?

![Machine Learning Q10](images/machine-learning_Q10.jpg)

1.

a decision tree

2.

reinforcement learning

3.

K-nearest neighbor

4.

a clear trendline

Q 10 / 92

Machine Learning

You work for a power company that owns hundreds of thousands of electric meters. These meters are connected to the internet and transmit energy usage data in real-time. Your supervisor asks you to direct project to use machine learning to analyze this usage data. Why are machine learning algorithms ideal in this scenario?

1.

The algorithms would help the meters access the internet.

2.

The algorithms will improve the wireless connectivity.

3.

The algorithms would help your organization see patterns of the data.

4.

By using machine learning algorithms, you are creating an IoT device.

Q 11 / 92

Machine Learning

To predict a quantity value. use `___`.

1.

regression

2.

clustering

3.

classification

4.

dimensionality reduction

Q 12 / 92

Machine Learning

Why is naive Bayes called naive?

1.

It naively assumes that you will have no data.

2.

It does not even try to create accurate predictions.

3.

It naively assumes that the predictors are independent from one another.

4.

It naively assumes that all the predictors depend on one another.

Q 13 / 92

Machine Learning

You work for an ice cream shop and created the chart below, which shows the relationship between the outside temperature and ice cream sales. What is the best description of this chart?

![Machine Learning Q14](images/machine-learning_Q14.jpg)

1.

It is a linear regression chart.

2.

It is a supervised trendline chart.

3.

It is a decision tree.

4.

It is a clustering trend chart.

Q 14 / 92

Machine Learning

How is machine learning related to artificial intelligence?

1.

Artificial intelligence focuses on classification, while machine learning is about clustering data.

2.

Machine learning is a type of artificial intelligence that relies on learning through data.

3.

Artificial intelligence is form of unsupervised machine learning.

4.

Machine learning and artificial intelligence are the same thing.

Q 15 / 92

Machine Learning

How do machine learning algorithms make more precise predictions?

1.

The algorithms are typically run more powerful servers.

2.

The algorithms are better at seeing patterns in the data.

3.

Machine learning servers can host larger databases.

4.

The algorithms can run on unstructured data.

Q 16 / 92

Machine Learning

You work for an insurance company. Which machine learning project would add the most value for the company!

1.

Create an artificial neural network that would host the company directory.

2.

Use machine learning to better predict risk.

3.

Create an algorithm that consolidates all of your Excel spreadsheets into one data lake.

4.

Use machine learning and big data to research salary requirements.

Q 17 / 92

Machine Learning

What is the missing information in this diagram?

![Machine Learning Q18](images/machine-learning_Q18.jpg)

1.

Training Set

2.

Unsupervised Data

3.

Supervised Learning

4.

Binary Classification

Q 18 / 92

Machine Learning

What is one reason not to use the same data for both your training set and your testing set?

1.

You will almost certainly underfit the model.

2.

You will pick the wrong algorithm.

3.

You might not have enough data for both.

4.

You will almost certainly overfit the model.

Q 19 / 92

Machine Learning

Your university wants to use machine learning algorithms to help sort through incoming student applications. An administrator asks if the admissions decisions might be biased against any particular group, such as women. What would be the best answer?

**Explanation**: While machine learning algorithms don't have bias, the data can have them.

1.

Machine learning algorithms are based on math and statistics, and so by definition will be unbiased.

2.

There is no way to identify bias in the data.

3.

Machine learning algorithms are powerful enough to eliminate bias from the data.

4.

All human-created data is biased, and data scientists need to account for that.

Q 20 / 92

Machine Learning

What is stacking?

1.

The predictions of one model become the inputs another.

2.

You use different versions of machine learning algorithms.

3.

You use several machine learning algorithms to boost your results.

4.

You stack your training set and testing set together.

Q 21 / 92

Machine Learning

You want to create a supervised machine learning system that identifies pictures of kittens on social media. To do this, you have collected more than 100,000 images of kittens. What is this collection of images called?

1.

training data

2.

linear regression

3.

big data

4.

test data

Q 22 / 92

Machine Learning

You are working on a project that involves clustering together images of different dogs. You take image and identify it as your centroid image. What type machine learning algorithm are you using?

**Explanation**: The problem explicitly states "clustering".

1.

centroid reinforcement

2.

K-nearest neighbor

3.

binary classification

4.

K-means clustering

Q 23 / 92

Machine Learning

Your company wants you to build an internal email text prediction model to speed up the time that employees spend writing emails. What should you do?

1.

Include training email data from all employees.

2.

Include training email data from new employees.

3.

Include training email data from seasoned employees.

4.

Include training email data from employees who write the majority of internal emails.

Q 24 / 92

Machine Learning

Your organization allows people to create online professional profiles. A key feature is the ability to create clusters of people who are professionally connected to one another. What type of machine learning method is used to create these clusters?

1.

unsupervised machine learning

2.

binary classification

3.

supervised machine learning

4.

reinforcement learning

Q 25 / 92

Machine Learning

What is this diagram a good example of?

![Machine Learning Q26](images/machine-learning_Q26.jpg) Note: there are centres of clusters (C0, C1, C2).

1.

K-nearest neighbor

2.

a decision tree

3.

a linear regression

4.

a K-means cluster

Q 26 / 92

Machine Learning

Random forest is modified and improved version of which earlier technique?

1.

aggregated trees

2.

boosted trees

3.

bagged trees

4.

stacked trees

Q 27 / 92

Machine Learning

Self-organizing maps are specialized neural network for which type of machine learning?

1.

semi-supervised learning

2.

supervised learning

3.

reinforcement learning

4.

unsupervised learning

Q 28 / 92

Machine Learning

Which statement about K-means clustering is true?

1.

In K-means clustering, the initial centroids are sometimes randomly selected.

2.

K-means clustering is often used in supervised machine learning.

3.

The number of clusters are always randomly selected.

4.

To be accurate, you want your centroids outside of the cluster.

Q 29 / 92

Machine Learning

You created machine learning system that interacts with its environment and responds to errors and rewards. What type of machine learning system is it?

1.

supervised learning

2.

semi-supervised learning

3.

reinforcement learning

4.

unsupervised learning

Q 30 / 92

Machine Learning

Your data science team must build a binary classifier, and the number one criterion is the fastest possible scoring at deployment. It may even be deployed in real time. Which technique will produce a model that will likely be fastest for the deployment team use to new cases?

1.

random forest

2.

logistic regression

3.

KNN

4.

deep neural network

Q 31 / 92

Machine Learning

Your data science team wants to use the K-nearest neighbor classification algorithm. Someone on your team wants to use a K of 25. What are the challenges of this approach?

1.

Higher K values will produce noisy data.

2.

Higher K values lower the bias but increase the variance.

3.

Higher K values need a larger training set.

4.

Higher K values lower the variance but increase the bias.

Q 32 / 92

Machine Learning

Your machine learning system is attempting to describe a hidden structure from unlabeled data. How would you describe this machine learning method?

1.

supervised learning

2.

unsupervised learning

3.

reinforcement learning

4.

semi-unsupervised learning

Q 33 / 92

Machine Learning

You work for a large credit card processing company that wants to create targeted promotions for its customers. The data science team created a machine learning system that groups together customers who made similar purchases, and divides those customers based on customer loyalty. How would you describe this machine learning approach?

1.

It uses unsupervised learning to cluster together transactions and unsupervised learning to classify the customers.

2.

It uses only unsupervised machine learning.

3.

It uses supervised learning to create clusters and unsupervised learning for classification.

4.

It uses reinforcement learning to classify the customers.

Q 34 / 92

Machine Learning

You are using K-nearest neighbor and you have a K of 1. What are you likely to see when you train the model?

1.

high variance and low bias

2.

low bias and low variance

3.

low variance and high bias

4.

high bias and high variance

Q 35 / 92

Machine Learning

1.

It will take too long for programmers to scrub poor data.

2.

If the data is high quality, the algorithms will be easier to develop.

3.

Low-quality data requires much more processing power than high-quality data.

4.

If the data is low quality, you will get inaccurate results.

Q 39 / 92

Machine Learning

In K-nearest neighbor, the closer you are to neighbor, the more likely you are to

1.

share common characteristics

2.

be part of the root node

3.

have a Euclidean connection

4.

be part of the same cluster

Q 40 / 92

Machine Learning

In the HBO show Silicon Valley, one of the characters creates a mobile application called Not Hot Dog. It works by having the user take a photograph of food with their mobile device. Then the app says whether the food is a hot dog. To create the app, the software developer uploaded hundreds of thousands of pictures of hot dogs. How would you describe this type of machine learning?

1.

Reinforcement machine learning

2.

unsupervised machine learning

3.

supervised machine learning

4.

semi-supervised machine learning

Q 41 / 92

Machine Learning

You work for a large pharmaceutical company whose data science team wants to use unsupervised learning machine algorithms to help discover new drugs. What is an advantage to this approach?

**Explanation**: This one is similar to an example talked about in the Stanford Machine Learning course.

1.

You will be able to prioritize different classes of drugs, such as antibiotics.

2.

You can create a training set of drugs you would like to discover.

3.

The algorithms will cluster together drugs that have similar traits.

4.

Human experts can create classes of drugs to help guide discovery.

Q 42 / 92

Machine Learning

In 2015, Google created a machine learning system that could beat a human in the game of Go. This extremely complex game is thought to have more gameplay possibilities than there are atoms of the universe. The first version of the system won by observing hundreds of thousands of hours of human gameplay; the second version learned how to play by getting rewards while playing against itself. How would you describe this transition to different machine learning approaches?

1.

The system went from supervised learning to reinforcement learning.

2.

The system evolved from supervised learning to unsupervised learning.

3.

The system evolved from unsupervised learnin9 to supervised learning.

4.

The system evolved from reinforcement learning to unsupervised learning.

Q 43 / 92

Machine Learning

The security company you work for is thinking about adding machine learning algorithms to their computer network threat detection appliance. What is one advantage of using machine learning?

1.

It could better protect against undiscovered threats.

2.

It would very likely lower the hardware requirements.

3.

It would substantially shorten your development time.

4.

It would increase the speed of the appliance.

Q 44 / 92

Machine Learning

You work for a hospital that is tracking the community spread of a virus. The hospital created a smartwatch application that uploads body temperature data from hundreds of thousands of participants. What is the best technique to analyze the data?

1.

Use reinforcement learning to reward the system when a new person participates.

2.

Use unsupervised machine learning to cluster together people based on patterns the machine discovers.

3.

Use Supervised machine learning to sort people by demographic data.

4.

Use Supervised machine learning to classify people by body temperature.

Q 45 / 92

Machine Learning

Many of the advances in machine learning have come from improved `___`.

1.

statistics

2.

structured data

3.

availability

4.

algorithms

Q 46 / 92

Machine Learning

What is this diagram a good example of?

![Machine Learning Q45](images/machine-learning_Q45.jpg)

1.

unsupervised learning

2.

complex cluster

3.

multiclass classification

4.

k-nearest neighbour

Q 47 / 92

Machine Learning

The supervisor asks to create a machine learning system that will help your hr dep. classify job applicants into well-defined groups.What type of system are more likely to recommend?

1.

deep learning artificial neural network that relies on petabytes of data

2.

unsupervised machine learning system that clusters together the best candidates

3.

Not recommend machine learning for this project

4.

supervised machine learning system that classifies applicants into existing groups // we do not need to classify best candidates we just need to classify job applicants in to existing categories

Q 48 / 92

Machine Learning

Someone of your data science team recommends that you use decision trees, naive Bayes and K-nearest neighbor, all at the same time, on the same training data, and then average the results. What is this an example of?

1.

regression analysis

2.

unsupervised learning

3.

high -variance modeling

4.

ensemble modeling

Q 49 / 92

Machine Learning

Your data science team wants to use machine learning to better filter out spam messages. The team has gathered a database of 100,000 messages that have been identified as spam or not spam. If you are using supervised machine learning, what would you call this data set?

1.

machine learning algorithm

2.

training set

3.

big data test set

4.

data cluster

Q 50 / 92

Machine Learning

You work for a website that enables customers see all images of themselves on the internet by uploading one self-photo. Your data model uses 5 characteristics to match people to their foto: color, eye, gender, eyeglasses and facial hair. Your customers have been complaining that get tens of thousands of fotos without them. What is the problem?

1.

You are overfitting the model to the data

2.

You need a smaller training set

3.

You are underfitting the model to the data

4.

You need a larger training set

Q 51 / 92

Machine Learning

Your supervisor asks you to create a machine learning system that will help your human resources department classify jobs applicants into well-defined groups. What type of system are you more likely to recommend?

1.

an unsupervised machine learning system that clusters together the best candidates.

2.

you would not recommend a machine learning system for this type of project.

3.

a deep learning artificial neural network that relies on petabytes of employment data.

4.

a supervised machine learning system that classifies applicants into existing groups.

Q 52 / 92

Machine Learning

You and your data science team have 1 TB of example data. What do you typically do with that data?

1.

you use it as your training set.

2.

You label it big data.

3.

You split it into a training set and test set.

4.

You use it as your test set.

Q 53 / 92

Machine Learning

Your data science team is working on a machine learning product that can act as an artificial opponent in video games. The team is using a machine learning algorithm that focuses on rewards: If the machine does some things well, then it improves the quality of the outcome. How would you describe this type of machine learning algorithm?

1.

semi-supervised machine learning

2.

supervised machine learning

3.

unsupervised machine learning

4.

reinforcement learning

Q 54 / 92

Machine Learning

The model will be trained with data in one single batch is known as ?

1.

Batch learning

2.

Offline learning

3.

Both A and B

4.

None of the above

Q 55 / 92

Machine Learning

Which of the following is NOT supervised learning?

1.

Decision Tree

2.

Linear Regression

3.

PCA

4.

Naive Bayesian

Q 56 / 92

Machine Learning

Suppose we would like to perform clustering on spatial data such as the geometrical locations of houses. We wish to produce clusters of many different sizes and shapes. Which of the following methods is the most appropriate?

1.

Decision Trees

2.

K-means clustering

3.

Density-based clustering

4.

Model-based clustering

Q 57 / 92

Machine Learning

The error function most suited for gradient descent using logistic regression is

1.

The entropy function.

2.

The squared error.

3.

The cross-entropy function.

4.

The number of mistakes.

Q 58 / 92

Machine Learning

Compared to the variance of the Maximum Likelihood Estimate (MLE), the variance of the Maximum A Posteriori (MAP) estimate is `___`

1.

Higher

2.

same

3.

Lower

4.

it could be any of the above

Q 59 / 92

Machine Learning

1.

What kernels extract

2.

Feature Maps

3.

How kernels Look

Q 65 / 92

Machine Learning

The activations for class A, B and C before softmax were 10,8 and 3. The different in softmax values for class A and class B would be :

![image](images/machine-learning_Q62.png)

1.

76%

2.

88%

3.

12%

4.

0.0008%

Q 66 / 92

Machine Learning

The new dataset you have just scraped seems to exhibit lots of missing values. What action will help you minimizing that problem?

1.

Wise fill-in of controlled random values

2.

Replace missing values with averaging across all samples

3.

Remove defective samples

4.

Imputation

Q 67 / 92

Machine Learning

Which of the following methods can use either as an unsupervised learning or as a dimensionality reduction technique?

1.

SVM

2.

PCA

3.

LDA

4.

TSNE

Q 68 / 92

Machine Learning

What is the main motivation for using activation functions in ANN?

1.

Capturing complex non-linear patterns

2.

Transforming continuous values into "ON" (1) or "OFF" (0) values

3.

Help avoiding the vanishing/exploding gradient problem

4.

Their ability to activate each neurons individually.

Q 69 / 92

Machine Learning

1.

Set up a cluster of machines to label the images

2.

Create a subset of the images and label then yourself

3.

Use naive Bayes to automatically generate labels.

4.

Hire people to manually label the images

Q 73 / 92

Machine Learning

The fit line and data in the figure exhibits which pattern?

![image](images/machine-learning_Q70.png)   `// since the data is accurately classified and is neither overfitting or underfitting the dataset`

1.

low bias, high variance

2.

high bias, low variance

3.

high bias, high variance

4.

low bias, low variance

Q 74 / 92

Machine Learning

Many of the advances in machine learning have come from improved?

1.

structured data

2.

algorithms

3.

time

4.

computer scientists

Q 75 / 92

Machine Learning

You need to select a machine learning process to run a distributed neural network on a mobile application. Which would you choose?

1.

Scikit-learn

2.

PyTorch

3.

Tensowflow Lite

4.

Tensorflow

Q 76 / 92

Machine Learning

Which choice is the best example of labeled data?

1.

a spreadsheet

2.

20,000 recorded voicemail messages

3.

100,000 images of automobiles

4.

hundreds of gigabytes of audio files

Q 77 / 92

Machine Learning

In statistics, what is defined as the probability of a hypothesis test of finding an effect - if there is an effect to be found?

1.

confidence

2.

alpha

3.

power

4.

significance

Q 78 / 92

Machine Learning

You want to create a machine learning algorithm to identify food recipes on the web. To do this, you create an algorithm that looks at different conditional probabilities. So if the post includes the word _flour_, it has a slightly stronger probability of being a recipe. If it contains both _flour_ and _sugar_, it even more likely a recipe. What type of algorithm are you using?

1.

naive Bayes classifier

2.

K-nearest neighbor

3.

multiclass classification

4.

decision tree

Q 79 / 92

Machine Learning

What is lazy learning?

1.

when the machine learning algorithms do most of the programming

2.

when you don't do any data scrubbing

3.

when the learning happens continuously

4.

when you run your computation in one big instance at the beginning

Q 80 / 92

Machine Learning

What is Q-learning reinforcement learning?

**Explanation**:Q-learning is a model-free reinforcement learning algorithm.Q-learning is a values-based learning algorithm. Value based algorithms updates the value function based on an equation(particularly Bellman equation).

1.

supervised machine learning with rewards

2.

a type of unsupervised learning that relies heavily on a well-established model

3.

a type of reinforcement learning where accuracy degrades over time

4.

a type of reinforcement learning that focuses on rewards

Q 81 / 92

Machine Learning

The data in your model has low bias and low variance. How would you expect the data points to be grouped together on the diagram?

1.

They would be grouped tightly together in the predicted outcome.

2.

They would be grouped tightly together but far from the predicted.

3.

They would be scattered around the predict outcome.

4.

They would be scattered far away from the predicted outcome.

Q 82 / 92

Machine Learning

Your machine learning system is using labeled examples to try to predict future data, compare that data to the predicted result, and then the model. What is the best description of this machine learning method?

1.

unsupervised learning

2.

semi-supervised learning

3.

supervised learning

4.

semi-reinforcement learning

Q 83 / 92

Machine Learning

In the 1983 movie WarGames, the computer learns how to master the game of chess by playing against itself. What machine learning method was the computer using?

1.

binary learning

2.

supervised learning

3.

unsupervised learning

4.

reinforcement learning

Q 84 / 92

Machine Learning

You are working with your machine learning algorithm on something called class preditor probability. What algorithm are you most likely using?

`//You could use a naïve Bayes algorithm, to differentiate three classes of dog breeds — terrier, hound, and sport dogs. Each class has three predictors — hair length, height, and weight. The algorithm does something called class predictor probability.`

1.

multiclass binary classification

2.

naive Bayes

3.

unsupervised classification

4.

decision tree analysis

Q 85 / 92

Machine Learning

What is one of the most effective way to correct for underfitting your model to the data?

1.

Create training clusters

2.

Remove predictors

3.

Use reinforcement learning

4.

Add more predictors

Q 86 / 92

Machine Learning

undefined

1.

Suggest that the team is probably underfitting the model to the data.

2.

Suggest that unsupervised learning will lead to more interesting results.

3.

Make sure that they are picking the correct machine learning algorithms.

4.

Encourage the team to ask more interesting questions.

Q 87 / 92

Machine Learning

What is the difference between unstructured and structured data?

1.

Unstructured data is always text.

2.

Unstructured data is much easier to store.

3.

Structured data has clearly defined data types.

4.

Sturctured data is much more popular.

Q 88 / 92

Machine Learning

You work for a startup that is trying to develop a software tool that will scan the internet for pictures of people using specific tools. The chief executive is very interested in using machine learning algorithms. What would you recommend as the best place to start?

1.

Using an unsupervised machine learning algorithm to cluster together all the photographs.

2.

Crate a data lake with an unsupervised machine learning algorithm.

3.

Use a combination of unsupervised and supervised machine learning to create machine-defined data clusters.

4.

Use supervised machine learning to classify photographs based on a predetermined training set.

Q 89 / 92

Machine Learning

In supervised machine learning, data scientist often have the challenge of balancing between underfitting or overfitting their data model. They often have to adjust the training set to make better predictions. What is this balance called?

1.

the under/over challenge

2.

balance between clustering classification

3.

bias-variance trade-off

4.

the multiclass training set challenge

Q 90 / 92

Machine Learning

What is conditional probability?

1.

the probability that doing one thing has an impact on another thing

2.

the probability that certain conditions are met

3.

the probability that, based on certain conditions, something will always be incorrect

4.

the probability of something being the correct answer

Q 91 / 92

Machine Learning

Naive Bayes looks at each _ predictor and creates a probability that belongs in each class.

1.

conditional

2.

multiclass

3.

independent

4.

binary

Q 92 / 92