19 T.W. Alexander Drive P.O. Box 14006 Research Triangle Park, NC 27709-4006 Tel: 919.685.9350 Fax: 919.685.9360 [email protected] |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||
Technology Transfer Short CourseData Mining and Machine LearningJuly 25-29, 2005at the NISS/SAMSI building
General Information
General InformationSAMSI is instituting a new summer activity—technology transfer short courses designed to consolidate results from (in most cases, earlier years') SAMSI programs, and to make the results available to working professionals in a compact, hands-on format. The first such course is derived from the 2003-04 SAMSI program on Data Mining and Machine Learning (DMML). The goals of the DMML technology transfer short course are to:
The theoretical component will emphasize ideas over rigor; the software component will sample the major techniques that are now commonly used for visualization, classification, and regression; and the applications component will walk participants through the practical analysis of some famous real-world data sets. The structure of the short course is that there will be three hours of lecture each morning. Each afternoon will start with a 90 minute computer lab that goes over an application using real data and relevant software, followed by a 90 minute lecture by a guest speaker. There will be several breaks during the day. The course begins with an introductory overview of data mining: its scope, classical approaches, and the heuristics that guided the initial development of theory and methods. Then the course moves towards the treatment of more modern issues such as boosting, overcompleteness, and large-p small-n problems. This leads to a survey of currently popular techniques, including random forests, support vector machines, wavelets, and PAC bounds. The main focus is upon a central focus of the SAMSI DMML program—regression inference, a paradigm that informs many data mining applications, but we also discuss clustering, classification, and multidimensional scaling. The prerequisites for the course are a basic knowledge of applied multivariate inference and a general level of statistical knowledge comparable to a master's degree. Any math will focus upon conveying general insight rather than specific details.
Course Contents
Principal instructor for the course will be David L. Banks, Professor of the Practice of Statistics and Decision Sciences at Duke University, and co-leader of the SAMSI DMML program.
ApplicationREGISTRATION IS NOW CLOSED
Schedule
|
|
Entire site © 2001-2008, Statistical and Applied Mathematical Sciences Institute. All Rights Reserved. |
|
|
|
|
|
| |