数据挖掘过程Data Mining
数据挖掘是什么?
Models and patterns from massive observational data sets.
Components of Data Mining Algorithms
Components of data mining algorithms
Representation
Determining the nature and structure of the representatiom to be used.
Score function
Quantifying and comparing how well different represenatations fit the data
Search/optimization method
Data management
Measurement:scale
Nominal:类型的,即特性只能通过name来区分Hair color
Ordinal:attributes can be ordered.无法做减法
Preference
只能做保持序列的映射
Interval:距离有意义
Temperature
Ratio:绝对零点,之间可以做比值
Weight,income
Why measurements are important?
数据结构数据类型数据结构都在讲允许对这些数据做什么操作.