95 lines
3.4 KiB
ReStructuredText
95 lines
3.4 KiB
ReStructuredText
Wine Data Database
|
|
====================
|
|
|
|
Notes
|
|
-----
|
|
Data Set Characteristics:
|
|
:Number of Instances: 178 (50 in each of three classes)
|
|
:Number of Attributes: 13 numeric, predictive attributes and the class
|
|
:Attribute Information:
|
|
- 1) Alcohol
|
|
- 2) Malic acid
|
|
- 3) Ash
|
|
- 4) Alcalinity of ash
|
|
- 5) Magnesium
|
|
- 6) Total phenols
|
|
- 7) Flavanoids
|
|
- 8) Nonflavanoid phenols
|
|
- 9) Proanthocyanins
|
|
- 10)Color intensity
|
|
- 11)Hue
|
|
- 12)OD280/OD315 of diluted wines
|
|
- 13)Proline
|
|
- class:
|
|
- class_0
|
|
- class_1
|
|
- class_2
|
|
|
|
:Summary Statistics:
|
|
|
|
============================= ==== ===== ======= =====
|
|
Min Max Mean SD
|
|
============================= ==== ===== ======= =====
|
|
Alcohol: 11.0 14.8 13.0 0.8
|
|
Malic Acid: 0.74 5.80 2.34 1.12
|
|
Ash: 1.36 3.23 2.36 0.27
|
|
Alcalinity of Ash: 10.6 30.0 19.5 3.3
|
|
Magnesium: 70.0 162.0 99.7 14.3
|
|
Total Phenols: 0.98 3.88 2.29 0.63
|
|
Flavanoids: 0.34 5.08 2.03 1.00
|
|
Nonflavanoid Phenols: 0.13 0.66 0.36 0.12
|
|
Proanthocyanins: 0.41 3.58 1.59 0.57
|
|
Colour Intensity: 1.3 13.0 5.1 2.3
|
|
Hue: 0.48 1.71 0.96 0.23
|
|
OD280/OD315 of diluted wines: 1.27 4.00 2.61 0.71
|
|
Proline: 278 1680 746 315
|
|
============================= ==== ===== ======= =====
|
|
|
|
:Missing Attribute Values: None
|
|
:Class Distribution: class_0 (59), class_1 (71), class_2 (48)
|
|
:Creator: R.A. Fisher
|
|
:Donor: Michael Marshall (MARSHALL%PLU@io.arc.nasa.gov)
|
|
:Date: July, 1988
|
|
|
|
This is a copy of UCI ML Wine recognition datasets.
|
|
https://archive.ics.uci.edu/ml/machine-learning-databases/wine/wine.data
|
|
|
|
The data is the results of a chemical analysis of wines grown in the same
|
|
region in Italy by three different cultivators. There are thirteen different
|
|
measurements taken for different constituents found in the three types of
|
|
wine.
|
|
|
|
Original Owners:
|
|
|
|
Forina, M. et al, PARVUS -
|
|
An Extendible Package for Data Exploration, Classification and Correlation.
|
|
Institute of Pharmaceutical and Food Analysis and Technologies,
|
|
Via Brigata Salerno, 16147 Genoa, Italy.
|
|
|
|
Citation:
|
|
|
|
Lichman, M. (2013). UCI Machine Learning Repository
|
|
[http://archive.ics.uci.edu/ml]. Irvine, CA: University of California,
|
|
School of Information and Computer Science.
|
|
|
|
References
|
|
----------
|
|
(1)
|
|
S. Aeberhard, D. Coomans and O. de Vel,
|
|
Comparison of Classifiers in High Dimensional Settings,
|
|
Tech. Rep. no. 92-02, (1992), Dept. of Computer Science and Dept. of
|
|
Mathematics and Statistics, James Cook University of North Queensland.
|
|
(Also submitted to Technometrics).
|
|
|
|
The data was used with many others for comparing various
|
|
classifiers. The classes are separable, though only RDA
|
|
has achieved 100% correct classification.
|
|
(RDA : 100%, QDA 99.4%, LDA 98.9%, 1NN 96.1% (z-transformed data))
|
|
(All results using the leave-one-out technique)
|
|
|
|
(2)
|
|
S. Aeberhard, D. Coomans and O. de Vel,
|
|
"THE CLASSIFICATION PERFORMANCE OF RDA"
|
|
Tech. Rep. no. 92-01, (1992), Dept. of Computer Science and Dept. of
|
|
Mathematics and Statistics, James Cook University of North Queensland.
|
|
(Also submitted to Journal of Chemometrics).
|