Data science is the discovery of knowledge and insight through the analysis of data. As such, it draws on the study of algorithms and their implementation from computer science, the power of abstraction and of geometric and topological formalism from mathematics, and the modeling and analysis of data from statistics. It has emerged as a separate field in response to the avalanche of data from web enabled sensors and instrumentation, mobile devices, web logs and transactions, and the availability of computing power for data storage and analysis. Modern data is challenging not only due to its large scale, but also because it is increasingly heterogeneous and unstructured. Information gleaned from this data none-the-less is revolutionizing diverse areas of human endeavor from health policy to high energy physics.