Air Foil (UCI, Regression, n=1503, d=5)¶
Loading The Data¶
In [1]:
from kxy_datasets.uci_regressions import AirFoil # pip install kxy_datasets
In [2]:
dataset = AirFoil()
df = dataset.df # Retrieve the dataset as a pandas dataframe
y_column = dataset.y_column # The name of the column corresponding to the target
problem_type = dataset.problem_type # 'regression' or 'classification'
In [3]:
df.kxy.describe() # Visualize a summary of the data
-----------------------
Column: Angle of Attack
-----------------------
Type: Continuous
Max: 22
p75: 9.9
Mean: 6.8
Median: 5.4
p25: 2.0
Min: 0.0
--------------------
Column: Chord Length
--------------------
Type: Continuous
Max: 0.3
p75: 0.2
Mean: 0.1
Median: 0.1
p25: 0.1
Min: 0.0
------------------------------
Column: Displacement Thickness
------------------------------
Type: Continuous
Max: 0.1
p75: 0.0
Mean: 0.0
Median: 0.0
p25: 0.0
Min: 0.0
-----------------
Column: Frequency
-----------------
Type: Continuous
Max: 20,000
p75: 4,000
Mean: 2,886
Median: 1,600
p25: 800
Min: 200
----------------------
Column: Sound Pressure
----------------------
Type: Continuous
Max: 140
p75: 129
Mean: 124
Median: 125
p25: 120
Min: 103
----------------
Column: Velocity
----------------
Type: Continuous
Max: 71
p75: 71
Mean: 50
Median: 39
p25: 39
Min: 31
Data Valuation¶
In [4]:
df.kxy.data_valuation(y_column, problem_type=problem_type)
[====================================================================================================] 100% ETA: 0s Duration: 0s
Out[4]:
Achievable R-Squared | Achievable Log-Likelihood Per Sample | Achievable RMSE | |
---|---|---|---|
0 | 0.80 | 8.03e-01 | 3.09 |
Automatic (Model-Free) Variable Selection¶
In [5]:
df.kxy.variable_selection(y_column, problem_type=problem_type)
[====================================================================================================] 100% ETA: 0s Duration: 0s
Out[5]:
Variable | Running Achievable R-Squared | Running Achievable RMSE | |
---|---|---|---|
Selection Order | |||
0 | No Variable | 0.00 | 6.90 |
1 | Frequency | 0.30 | 5.76 |
2 | Displacement Thickness | 0.73 | 3.61 |
3 | Chord Length | 0.75 | 3.43 |
4 | Velocity | 0.80 | 3.09 |
5 | Angle of Attack | 0.80 | 3.09 |