Professional Documents
Culture Documents
Lecture 1
Jan. 24, 2017
Kate Cowles
374 SH, 335-0727
kate-cowles@uiowa.edu
Learning from data
https://lagunita.stanford.edu/c4x/HumanitiesScience/
StatLearning/asset/introduction.pdf
input: binary-coded image
output: classication as one of 10 digits
0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1
0 0 0 0 0 0 0 0 0 1 1 1 1 0 0 0
0 0 0 0 0 0 0 1 1 1 1 0 0 0 0 0
0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0
0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0
0 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0
0 0 1 1 1 1 1 1 0 0 0 0 0 0 0 0
0 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0
1 1 1 1 1 0 0 0 1 1 1 0 0 0 0 0
1 1 1 0 0 0 0 0 0 1 1 0 0 0 0 0
1 1 1 0 0 0 0 0 0 1 1 0 0 0 0 0
1 1 0 0 0 0 0 0 0 1 1 0 0 0 0 0
1 1 0 0 0 0 0 0 1 1 0 0 0 0 0 0
1 1 0 0 0 0 0 1 1 0 0 0 0 0 0 0
1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0
0 0 1 1 1 0 0 0 0 0 0 0 0 0 0 0, 6
X Y
Statistics Independent variables Dependent variables
Predictors Responses
Covariates
Machine learning Inputs Outputs
Pattern recognition Features
Supervised learning { more on data
Machine Statistical
learning learning
Parent discipline Articial Statistics
intelligence (c.s)
Emphases Large scale; Models (relationships)
Prediction Uncertainty
accuracy
Stamey, T., Kabalin, J., McNeal, J., Johnstone, I., Freiha, F.,
Redwine, E. and Yang, N (1989) Prostate specic antigen in the
diagnosis and treatment of adenocarcinoma of the prostate II.
Radical prostatectomy treted patients, Journal of Urology 16:
1076-1083.
included in R package ElemStatLearn
variables available before surgery
{ age in years
{ gleason the gleason score based on biopsy; integer values from
2 (not likely to spread quickly) to 10 (likely to be aggressive)
{ lpsa log of PSA test value
variables available only after prostate is removed
{ lweight log prostate weight
{ lbph log of the amount of benign prostatic hyperplasia
{ pgg45 percent of Gleason grade 4 or 5
{ lcavol log cancer volume
{ svi seminal vesicle invasion (if yes, cancer has spread outside
of prostate)
{ lcp log of capsular penetration (how much has has tumor
extended through the capsule surrounding the prostate)
variable added for analysis purposes
{ train logical; training or test data