Loading…
Scalæ By the Bay has ended
Back To Schedule
Friday, November 11 • 9:50am - 10:10am
Introduction to Big Data and Spark

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

This is an introductory talk for those who want to get into Big Data and learn about Spark, but don't know where to start. Spark is a fast easy-to-use general-purpose cluster computing framework for processing large datasets. It has become the most active open-source big data project. It is hotter than Hadoop was a few years ago. The talk will start with an introduction to Big Data and the challenges associated with it. Next, Mohammed will dive into Spark and talk about how it can be used to solve those challenges. In addition, he will discuss the following: a) Why Spark has set the Big Data world on fire b) Why people are replacing Hadoop MapReduce with Spark c) What kind of applications benefit from Spark d) How Spark works (high-level architecture) Finally, he will introduce the key libraries that come pre-packaged with Spark and discuss how these libraries simplify a variety of analytical tasks, including: a) Batch processing b) Interactive ad hoc analytics c) Stream processing d) Graph analytics e) Machine learning

Speakers
avatar for Mohammed Guller

Mohammed Guller

Principal Architect, Glassbeam
Passionate about building new products, machine learning, and big data analytics. Built several products from the ground up. Author of Big Data Analytics with Spark.


Friday November 11, 2016 9:50am - 10:10am PST
Off by One