Opening Workshop, Massive Datasets Program - September 9-12, 2012
Workshop Information
General Information
The Opening Workshop will be held Sunday-Wednesday, 9-12 September 2012, at the Radisson Hotel Research Triangle Park, NC. The hotel is in close proximity to SAMSI.
Tentative Schedule
Sunday, September 9, 2012
Radisson RTP
| 8:30-9:00 | Registration and Continental Breakfast |
| 8:50-9:00 | Welcome and Introduction Ilse Ipsen, N.C. State University/SAMSI |
| Tutorials | |
| 9:00-10:00 | Tamas Budavari, Johns Hopkins University Statistical Methods in Astronomy |
| 10:00-10:30 | Break |
| 10:30-11:30 | Petros Drineas, Rensselaer Polytechnic Institute Mining Massive Datasets: A (randomized) Linear Algebraic Perspective |
| 11:30-1:00 | Lunch |
| 1:00-2:00 | Haesun Park, Georgia Institue of Technology Visual Analytics for Knowledge Discovery in High Dimensional Data |
| 2:00-2:30 | Break |
| 2:30-3:30 | Stephen Wright, University of Wisconsin Optimization Techniques for Statistical Analysis on Large Datasets |
| 3:30-4:00 | Break |
| 4:00-5:00 | Michael Jordan, Univ. of California-Berkeley Resampling Methods for Massive Data |
Monday, September 10, 2012
Radisson RTP
| 8:30-8:55 | Registration and Continental Breakfast |
| 8:55-9:00 | Welcome |
| Session: Inference | |
| 9:00-9:45 | Bin Yu, University of California, Berekeley |
| 9:45-10:30 | Jonathan Taylor, Stanford University |
| 10:30-11:00 | Break |
| 11:00-11:45 | Brian Caffo, Johns Hopkins University |
| 11:45-12:15 | Panel Chair: Bill Eddy, Carnegie Mellon University |
| 12:15-1:30 | Lunch |
| Session: Imaging | |
| 1:30-2:15 | Jim Nagy, Emory University |
| 2:15-3:00 | Jianqing Fan, Princeton University |
| 3:00-3:30 | Break |
| 3:30-4:15 | Rollin Thomas, Lawrence Berkeley National Lab |
| 4:15-4:45 | Panel Co-Chairs: Daniela Ushizima, Lawrence Berkeley National Lab Jiayang Sun, Case Western Reserve |
| 4:45-5:15 | Poster blitz (2 minutes per poster) |
| 5:15-5:30 | Break |
| 5:30-7:30 | Poster Session and Reception SAMSI will provide poster presentation boards and tape. The board dimensions are 4 ft. wide by 3 ft. high. They are tri-fold with each side being 1 ft. wide and the center 2 ft. wide. Please make sure your poster fits the board. The boards can accommodate up to 16 pages of paper measuring 8.5 inches by 11 inches. |
Tuesday, September 11, 2012
Radisson RTP
| 8:30-9:00 | Registration and Continental Breakfast |
| Session: Environment & Climate | |
| 9:00-9:45 | Anna Michalak, Stanford University |
| 9:45-10:30 | Dan Crichton, Jet Propulsion Lab |
| 10:30-11:00 | Break |
| 11:00-11:45 | Noel Cressie, Ohio State University |
| 11:45-12:15 | Panel: Chair: Jessica Matthews, CICS-NC Panelists: Amy Braverman, Jet Propulsion Lab, Steve Sain, NCAR, Richard Smith, SAMSI/UNC-CH |
| 12:15-1:30 | Lunch |
| Session: High Energy Physics | |
| 1:30-2:15 | Steffen Bass, Duke University |
| 2:15-3:00 | Kyle Cranmer, Duke University |
| 3:00-3:30 | Break |
| 3:30-4:15 | Luc Demortier, Rockefeller University |
| 4:15-4:45 | Panel |
| 4:45-6:00 | Open Mic and Refreshments |
Wednesday, September 12, 2012
Radisson RTP
| 8:30-9:00 | Registration and Continental Breakfast |
| Session: Streaming, Sketching & Datamining | |
| 9:00-9:45 | Michael Mahoney, Stanford University |
| 9:45-10:30 | Maryann Fazel, University of Washington |
| 10:30-11:00 | Break |
| 11:00-11:45 | Inderjit Dhillon, University of Texas, Austin |
| 11:45-12:15 | Panel Chair: Piotr Indyk, MIT |
| 12:15-1:30 | Lunch |
| Working Groups | |
| 1:30-3:00 | Working Group Formation and Initial Meeting |
| 3:00-3:30 | Break |
| 3:30-4:30 | Working Group Reports |
Partial list of research topics:
* Data visualization and analytics:
High-speed visualization of high-dimensional datasets; data representation, extraction, integration and transformation; real-time visual interaction; spatio-temporal data mining
* Online streaming and sketching:
Algorithm paradigms for massive datasets (streaming, online, randomized); scalability; filtering; anomaly detection; data structures for fast computation of statistics; database enabled machine learning tools; computing environments and programming models (GPU's, cloud computing, custom chips)
* Large-scale optimization:
Convex optimization (sparse modeling and compressed sensing, matrix completion); online optimization (streaming data, on-line learning, control theory); distributed optimization (parallel and GPU computation, data fusion); machine learning; high-dimensional models
* Inference:
Dimension reduction for high-dimensional data (feature selection, sub-sampling and screening, sparse PCA); predictive inference and multiple testing (false discovery rates, uncertainty in prediction); high-dimensional MCMC methods for posterior inference (particle filters, hybrids with optimization methods)
* Imaging:
Rapid registration and segmentation (GPU's, distributed computing); multiple testing and inference for large-scale imaging data (sky surveys, satellite images, false discovery rate with dependence); dynamic imaging (streaming data, spatio-temporal models)
* Systems and architectures :
Reliability; resilience; probabilistic computing, multiple precision; real-time methods; variable data flows; hardware platforms
* High-energy physics:
Reconstruction and analysis of particle collisions from the LHC; pattern recognition and parameter extraction; simulations to estimate error rates; parameter estimation for large numbers of parameters; maximum likelihood estimators
* Astronomy:
Statistics on remote resources; computations on special purpose architectures and GPUs; communication avoiding methods; randomized and online algorithms; detection and classification of transient events and outliers; Bayesian inference and machine learning; high dimensional models with empirical priors; non-parametric models; visualization of large high-dimensional datasets
* Environment and climate:
Production, validation, processing, distribution and integration of data; data fusion and remote sensing; algorithms for large distributed datasets; spatial or spatio-temporal statistics
APPLICATION
To begin the application process login and register, then click on "Apply for Workshop" button at the top of this page. When you have completed the form, be sure to click on the green button.
This form also includes the application for financial support. You will be notified as soon as possible after your application if your participation will be possible; regrettably, limited seating will preclude acceptance of all applications. New researchers (graduate students, postdocs, and faculty in the early stages of their careers) and members of underrepresented groups are especially encouraged to apply.
The application/registration deadline is August 10, 2012. Note, however, that capacity might be reached before this deadline. Also, any application that requests support for expenses should typically be submitted at least 6 weeks in advance of the workshop.
Please make reservations at the Radisson RTP as soon as possible. The SAMSI room block and rate ($91) is effective until August 10, 2012. After this date, there is no guarantee a room will be available. If you have a change in plans, individual room reservations must be cancelled 72 hours prior to arrival. Check-in is at 3:00 PM; check-out is 12:00 noon.
If you need further information please send an email to md@samsi.info.
