Courses: Fall 2017/Spring 2018
Courses numbered 600 or above (such as Stat 610a) are intended primarily for graduate students. If such a course does not have an undergraduate cross-listing, undergraduates need special permission to enroll. A different course summary page is available here.
Director of Undergraduate Studies: Dan Spielman and Sekhar Tatikonda.
Director of Graduate Studies: John Emerson and Andrew Barron
Course requirements for Ph.D. and M.A. students
STAT 100b / STAT 500b Introductory Statistics
An introduction to statistical reasoning. Topics include numerical and graphical summaries of data, data acquisition and experimental design, probability, hypothesis testing, confidence intervals, correlation and regression. Application of statistical concepts to data; analysis of real-world problems.
STAT 242b / STAT 542b Theory of Statistics
Study of the principles of statistical analysis. Topics include maximum likelihood, sampling distributions, estimation, confidence intervals, tests of significance, regression, analysis of variance, and the method of least squares. Some statistical computing.
STAT 251b / STAT 551b/ENAS 496 Stochastic Processes
Introduction to the study of random processes, including Markov chains, Markov random fields, martingales, random walks, Brownian motion, and diffusions. Techniques in probability, such as coupling and large deviations. Applications chosen from image reconstruction, Bayesian statistics, finance, probabilistic analysis of algorithms, and genetics and evolution.
STAT 330b / STAT 600b / MATH 330b Advanced Probability
Measure theoretic probability, conditioning, laws of large numbers, convergence in distribution, characteristic functions, central limit theorems, martingales. Some knowledge of real analysis is assumed.
STAT 361b / STAT 661b Data Analysis
Selected topics in statistics explored through analysis of data sets using the R statistical computing language. Topics include linear and nonlinear models, maximum likelihood, resampling methods, curve estimation, model selection, classification, and clustering. Weekly sessions in the Statistical Computing laboratory.
After STAT 242 and MATH 222 or 225, or equivalents.
STAT 363b / STAT 660b Multivariate Statistics for Social Sciences
Introduction to the analysis of multivariate data as applied to examples from the social sciences. Topics include principal components analysis, factor analysis, cluster analysis (hierarchical clustering, k-means), discriminant analysis, multidimensional scaling, and structural equations modeling. Extensive computer work using either SAS or SPSS programming software.
Prerequisites: knowledge of basic inferential procedures and experience with linear models.
STAT 364b / STAT 664b Information Theory
Foundations of information theory in mathematical communications, statistical inference, statistical mechanics, probability, and algorithmic complexity. Quantities of information and their properties: entropy, conditional entropy, divergence, redundancy, mutual information, channel capacity. Basic theorems of data compression, data summarization, and channel coding. Applications in statistics and finance.
After Statistics 241.
STAT 365b / STAT 665b Data Mining and Machine Learning
Techniques for data mining and machine learning are covered from both a statistical and a computational perspective, including support vector machines, bagging, boosting, neural networks, and other nonlinear and nonparametric regression methods. The course will give the basic ideas and intuition behind these methods, a more formal understanding of how and why they work, and opportunities to experiment with machine learning algorithms and apply them to data. After STAT 242b.
STAT 480ab Individual Studies
Directed individual study for qualified students who wish to investigate an area of statistics not covered in regular courses. A student must be sponsored by a faculty member who sets the requirements and meets regularly with the student. Enrollment requires a written plan of study approved by the faculty adviser and the director of undergraduate studies.
Permission required. No final Exam.
STAT 490b Statistics Senior Seminar Project
Under the supervision of a member of the faculty, each student works on an independent project. Students participate in seminar meetings at which they speak on the progress of their projects.
Permission required. No final Exam.
STAT 100b / STAT 500b Introductory Statistics
An introduction to statistical reasoning. Topics include numerical and graphical summaries of data, data acquisition and experimental design, probability, hypothesis testing, confidence intervals, correlation and regression. Application of statistical concepts to data; analysis of real-world problems.
STAT 242b / STAT 542b Theory of Statistics
Study of the principles of statistical analysis. Topics include maximum likelihood, sampling distributions, estimation, confidence intervals, tests of significance, regression, analysis of variance, and the method of least squares. Some statistical computing.
STAT 251b / STAT 551b/ENAS 496 Stochastic Processes
Introduction to the study of random processes, including Markov chains, Markov random fields, martingales, random walks, Brownian motion, and diffusions. Techniques in probability, such as coupling and large deviations. Applications chosen from image reconstruction, Bayesian statistics, finance, probabilistic analysis of algorithms, and genetics and evolution.
STAT 330b / STAT 600b / MATH 330b Advanced Probability
Measure theoretic probability, conditioning, laws of large numbers, convergence in distribution, characteristic functions, central limit theorems, martingales. Some knowledge of real analysis is assumed.
STAT 361b / STAT 661b Data Analysis
Selected topics in statistics explored through analysis of data sets using the R statistical computing language. Topics include linear and nonlinear models, maximum likelihood, resampling methods, curve estimation, model selection, classification, and clustering. Weekly sessions in the Statistical Computing laboratory.
After STAT 242 and MATH 222 or 225, or equivalents.
STAT 363b / STAT 660b Multivariate Statistics for Social Sciences
Introduction to the analysis of multivariate data as applied to examples from the social sciences. Topics include principal components analysis, factor analysis, cluster analysis (hierarchical clustering, k-means), discriminant analysis, multidimensional scaling, and structural equations modeling. Extensive computer work using either SAS or SPSS programming software.
Prerequisites: knowledge of basic inferential procedures and experience with linear models.
STAT 364b / STAT 664b Information Theory
Foundations of information theory in mathematical communications, statistical inference, statistical mechanics, probability, and algorithmic complexity. Quantities of information and their properties: entropy, conditional entropy, divergence, redundancy, mutual information, channel capacity. Basic theorems of data compression, data summarization, and channel coding. Applications in statistics and finance.
After Statistics 241.
STAT 365b / STAT 665b Data Mining and Machine Learning
Techniques for data mining and machine learning are covered from both a statistical and a computational perspective, including support vector machines, bagging, boosting, neural networks, and other nonlinear and nonparametric regression methods. The course will give the basic ideas and intuition behind these methods, a more formal understanding of how and why they work, and opportunities to experiment with machine learning algorithms and apply them to data. After STAT 242b.
STAT 611b Selected Topics in Statistical Decision Theory
In this course we will review some recent developments in statistical decision theory including nonparametric estimation, high dimensional (non)linear estimation, low rank and sparse matrices estimation, covariance matrices estimation, graphical models, and network analysis.
Prerequisite: Statistics 610.
STAT 626b Practical Work
Individual one-semester projects, with students working on studies outside the Department, under the guidance of a statistician. This course is a one-credit requirement for the Ph.D. degree.
STAT 627ab Statistical Consulting
Statistical consulting and collaborative research projects often require statisticians to explore new topics outside their area of expertise. This course exposes students to real problems, requiring them to draw on their expertise in probability, statistics, and data analysis. Students complete the course with individual projects supervised jointly by faculty outside the department and by one of the instructors. Students enroll for both terms and receive one credit at the end of the year.
STAT 645b / BIS 692b / CB&B 645b Statistical Methods in Genetic and Bioinformatics
Introduction to problems, algorithms, and data analysis approaches in computational biology and bioinformatics; stochastic modeling and statistical methods applied to problems such as mapping disease-associated genes, analyzing gene expression microarray data, sequence alignment, and SNP analysis. Statistical methods include maximum likelihood, EM, Bayesian inference, Markov chain Monte Carlo, and some methods of classification and clustering; models include hidden Markov models, Bayesian networks, and the coalescent. The limitations of current models, and the future opportunities for model building, are critically addressed.
STAT 654b Topics in Bayesian Inference and Data Analysis
Topics in the theory and practice of Bayesian statistical inference, ranging from a review of fundamentals to questions of current research interest. Motivation for the Bayesian approach, Bayesian computation, Monte Carlo methods, use of software (including R, BUGS, and possibly others), asymptotics, model checking and comparison, empirical Bayes approaches, hierarchical models, and Bayesian nonparametrics. A selection of other topics as time permits; possibilities include Bayesian design, variational methods, and approximate Bayesian computation. Assumed background includes probability and statistics at least at the level of STAT 541 and 542, Markov Chains as covered in STAT 551, and computing in R.
STAT 662b Statistical Computing
Topics in the practice of data analysis and statistical computing, with particular attention to problems involving massive data sets or large, complex simulations and computations. Programming with R, C/C++, and Python; computational efficiency, memory management, interactive and dynamic graphics, and parallel computing.
STAT 674b / F&ES 781b Applied Spatial Statistics
An introduction to spatial statistical techniques with computer applications. Topics include spatial sampling, visualizing spatial data, quantifying spatial association and autocorrelation, interpolation methods, fitting variograms, kriging, and related modeling techniques for spatially correlated data. Examples are drawn from ecology, sociology, public health, and subjects proposed by students. Four to five lab/homework assignments and a final project. The class makes extensive use of the R programming language as well as ArcGIS.
STAT 690ab Independent Study
By arrangement with faculty. Approval of DGS required.
STAT 699ab Research Seminar in Probability
Continuation of the Yale Probability Group Seminar. Student and faculty explanations of current research in areas such as random graph theory, spectral graph theory, Markov chains on graphs, and the objective method.
Credit only with the explicit permission of the seminar organizers.
STAT 700ab Departmental Seminar
Important activity for all members of the department. See webpage for weekly seminar announcements.