This event has ended. View the official site or create your own event → Check it out
This event has ended. Create your own
View analytic
Friday, November 11 • 2:10pm - 2:50pm
Scala: The unpredicted lingua franca for data science

Sign up or log in to save this to your schedule and see who's attending!

It was true until pretty recently that data scientists’ languages of choice to manipulate and make sense out of data were Python, R, or MATLAB, which led to split in the data science community and duplication of efforts in languages offering similar sets of functionality. Then distributed technologies came out of the blue, most using a convenient and easy-to-deploy platform, the JVM. Data scientists are now part of heterogeneous teams that face many problems and must work toward global solutions together, including a new responsibility to be productive and agile in order to have their work integrated into platforms. This is why technologies like Apache Spark are so important and are gaining this traction from different communities. And even though some bindings are available for legacy languages, all the creative, new ways to analyze data are done in Scala. Using a fully productive and reproducible environment combining the Spark Notebook and Docker, Xavier Tordoir explore what it means to do data science today and why Scala succeeds at coping with large and fast data where older languages fail. Xavier then introduce and summarize all the new methodologies and scientific advances in machine learning that use Scala as the main language, including Splash, mic-cut problem, OptiML, needle (DL), ADAM, and more, and demonstrate how these programs work for data scientists by enabling interactivity, live reactivity, charting capabilities, and robustness in Scala—things that were still missing from the legacy languages.

avatar for Xavier Tordoir

Xavier Tordoir

Founder, Data Fellas, Inc.
Xavier started his career as a researcher in Experimental Physics and also focused on data processing. Further down the road, he took part in projects in finance, genomics and software development for academic research. During that time, he worked on timeseries, on prediction of biological molecular structures and interactions, and applied Machine Learning methodologies. He developed solutions to manage and process data distributed across data... Read More →

Friday November 11, 2016 2:10pm - 2:50pm
Off by One

Attendees (36)