Statistical and Applied Mathematical Sciences Institute
19 T. W. Alexander Drive
P.O. Box 14006
Research Triangle Park, NC 27709-4006
Tel: 919.685.9350 FAX: 919.685.9360
info@samsi.info

 

SAMSI UNDERGRADUATE WORKSHOP

Data Mining: Handling the Flood of Data

February 13-14, 2004

 

By almost any measure, data generation capabilities exceed and are growing faster than data analysis capabilities. Gigabyte-sized data sets are common, terabyte-sized data sets exist, and petabyte-sized data sets are on the way. Over the past fifteen years, motivated by problems such as speech recognition, credit card fraud, genetics and homeland security, the fields of data mining and machine learning (DMML) have emerged to attempt to cope with the flood of data.

This workshop will introduce undergraduates to DMML using adaptive, interactive demonstrations. It will feature multiple problem contexts, including bioinformatics (drug discovery), information technology (software engineering) and automobile sales data. Both underlying concepts, some of which are quite simple despite the extreme computational demands and current research frontiers, such as privacy preserving data mining, will be covered.

Interested juniors and seniors in mathematics, statistics and computer science in accredited US colleges and universities, especially women and members of under-represented minorities, are encouraged to apply. Since the purpose of the workshop is to introduce undergraduates to this new emerging field no previous exposure to data mining or machine learning is necessary.


As part of its Education and Outreach Program for 2003-2004, SAMSI will conduct a series of two day undergraduate workshops on topics of current interest in statistics and applied mathematics. In addition to an overview of current and planned SAMSI Research Programs, a featured topic will be covered in some depth.

The second of these workshops will be held on February 13-14, 2004 at SAMSI and will focus on Data Mining. The program will begin at 9:30 AM on Friday, February 13, and will be completed by 12 noon on Saturday, February 14. Participants are urged to arrive on Thursday evening and will be able to begin their return home by 12 noon on Saturday.


Instructions to Registrants

To register for the workshop, use the on-line registration form. Applications will be considered beginning January 1, 2004 and continuing until all participant openings are filled. Registrations received by January 22, 2004 will be given full consideration. Upon acceptance for the program, individuals must confirm by e-mail within seven days their intention to participate and also provide details of their travel plans.


 

Data Mining and Machine Learning Home Page

SAMSI Home Page

 

Entire site © 2001-2003, Statistical and Applied Mathematical Sciences Institute. All Rights Reserved.