quaxp-1-final-1
Year
Month
city
state
country
shape
duration (seconds)
duration (hours/min)
comments
date posted
latitude
longitude
quaxp-1-final-2
Year
Month
state
country
shape
duration (seconds)
latitude
longitude
Select the true statements about this histogram:
Select the true statements about this boxplot:
In the UFO dataset, the attribute country contains 11791 missing values, on a total of 87512 instances (13% of missing values). What can you do?
The attribute “duration (seconds)” contains 7037 wrong values (values filled with 0). The attribute “duration (hours/min)” looks like the picture above. Is it possible to recover the values of the attribute “duration (seconds)”? You want to build a model to predict the shape of the UFO. Which statements are true?
You want to build a model to predict the shape of a UFO. Which attributes do you keep for the prediction?
We are now in possession of a dataset containing information about some wines. The goal is to predict the attribute “points” (which is a grade for the wine in percent, from 0 to 100) from its price. This graph is the results of the prediction vs. true label.
Which statements are true about the graph?
With the same wine problem as in the previous question, here is the description of the two attributes (price is the predictive attribute, points is the predicted attribute).
Would a MAE of 2,25 (for the attribute points) be considered good?
This is the list of the attributes in another wine dataset. All the attributes are numerical, except for the categorical attribute “type”, which is composed of the labels “red” and “white”.
We want to predict the attribute “quality” (a grade between 0 and 10), based on the other attributes. On which attribute could we split the dataset to try to increase the performance of the model?