Statistics is the science and art of prediction and explanation. The mathematical foundation of statistics lies in the theory of probability, which is applied to problems of making inferences and decisions under uncertainty. Practical statistical analysis also uses a variety of computational techniques, methods of visualizing and exploring data, methods of seeking and establishing structure and trends in data, and a mode of questioning and reasoning that quantifies uncertainty.
Data Science expands on Statistics to encompass the entire lifecycle of data, from its specification, gathering and cleaning, through its management and analysis, to its use in making decisions and setting policy. It is a natural outgrowth of Statistics that incorporates advances in Machine Learning, Data Mining and High-Performance Computing along with domain expertise in the Social Sciences, Natural Sciences, Engineering, Management, Medicine and Digital Humanities.