Exploratory data analysis
Encyclopedia
|
| Tutorials | Encyclopedia | Dictionary | Directory |
|
Exploratory data analysis
Exploratory data analysis (EDA) is an approach to analyzing data for the purpose of formulating hypotheses worth testing, complementing the tools of conventional statistics for testing hypotheses[1]. It was so named by John Tukey.
EDA developmentTukey held that too much emphasis in statistics was placed on statistical hypothesis testing (confirmatory data analysis); more emphasis needed to be placed on using data to suggest hypotheses to test. In particular, he held that confusing the two types of analyses and employing them on the same set of data can lead to systematic bias owing to the issues inherent in testing hypotheses suggested by the data. The objectives of EDA are to:
Tukey's books were notoriously opaque, and so several attempts were made to popularise his EDA ideas. Prominent among these was the Statistics in Society (MDST242) course of The Open University. Many EDA techniques have been adopted into data mining and are being taught to young students as a way to introduce them to statistical thinking.[2] TechniquesThere are a number of tools that are useful for EDA, but EDA is characterized more by the attitude taken than by particular techniques.[3] The principal graphical techniques used in EDA are: The principal quantitative techniques are: Graphical and quantitative techniques are: HistoryMany EDA ideas can be traced back to earlier authors, for example:
The Open University course Statistics in Society (MDST 242), took the above ideas and merged them with Gottfried Noether's work, which introduced statistical inference via coin-tossing and the median test. For details of the above, see John Bibby's book HOTS: History of Teaching Statistics. Software
See also
Bibliography
Notes
References
External links
de:Explorative Statistik es:Análisis exploratorio de datos (prehistoria) ko:??? ?? ?? pl:Eksploracja danych pt:Análise exploratória de dados Source: Wikipedia | The above article is available under the GNU FDL. | Edit this article
|
|
top
©2008-2009 TutorGig.com. All Rights Reserved. Privacy Statement