39 тысяч подписчиков
698 видео
Engineering Fast Indexes for Big Data Applications: Spark Summit East talk by Daniel Lemire
Building Real Time BI Systems with Kafka, Spark & Kudu: Spark Summit East talk by Ruhollah Farchtchi
Spark Streaming Pushing the Throughout Limits, the Reactive Way
Fusing Apache Spark and Lucene for Near Realtime Predictive Model Building (Deb Das)
Using Spark and Elasticsearch for Real-time Data Analysis- Costin Leau (Elasticsearch)
Optimizing Spark Deployments for Containers: Isolation, Safety & Performance by William Benton
Magellan: Spark as a Geospatial Analytics Engine
Spark and Cassandra - Martin Van Ryswyk
Lessons Learned from Dockerizing Spark Workloads: Spark Summit East talk by Tom Phelan
Optimizing Apache Spark SQL Joins: Spark Summit East talk by Vida Ha
Lambda Architecture, Analytics and Data Pathways with Spark Streaming, Kafka, Akka and Cassandra
The Fast Path to Building Operational Applications with Spark: talk by Nikita Shamgunov
Building a modern data discovery and BI platform using Apache Spark and Catalyst with Kevin Beyer
Accelerating Machine Learning and Deep Learning At Scale With Apache Spark: talk by Ziya Ma
EclairJS = Node Js + Apache Spark
SparkSQL: A Compiler from Queries to RDDs: Spark Summit East talk by Sameer Agarwal
Machine Learning on Spark RoundTable
Spark Streaming and IoT
Clickstream Analysis with Spark—Understanding Visitors in Realtime
Debugging PySpark: Spark Summit East talk by Holden Karau
Building, Debugging, and Tuning Spark Machine Learning Pipelines - Joseph Bradley (Databricks)
Extending Spark with Java Agents (Jaroslav Bachorik)
Scientific Image Analysis Using Spark- Kevin Mader (ETH Zurich / Paul Scherrer Institut)
Using Spark and Riak for IoT Apps—Patterns and Anti Patterns: Spark Summit East talk by Pavel Hardak
Apache Toree: A Jupyter Kernel for Spark: Spark Summit East talk by Marius van Niekerk
Top 5 Mistakes When Writing Spark Applications
Spark on large Hadoop cluster and evaluation - Masaru Dobashi (NTT Data Corporation)
Lessons Learned From Running Spark On Docker
Why Spark on Hadoop Matters - M.C. Srivas
Interactive Visualization of Streaming Data Powered by Spark
Understanding Memory Management In Spark For Fun And Profit
Data Profiling and Pipeline Processing with Spark
Learnings Using Spark Streaming and DataFrames for Walmart Search: by Nirmal Sharma and Yan Zheng
Efficient State Management With Spark 2 0 And Scale Out Databases
Sparking Up Data Engineering: Spark Summit East talk by Rohan Sharma
Analytics at the Real Time Speed of Business: Spark Summit East talk by Manish Gupta
Effective Spark with Alluxio: Spark Summit East talk by Gene Pang and Haoyuan Li
Spark on YARN The Road Ahead- Marcelo Vanzin (Cloudera)
High Performance Python On Spark
Spark SQL: Another 16x Faster After Tungsten: Spark Summit East talk by Brad Carlile
Large Scale Text Processing Pipeline with Spark ML & GraphFrames: talk by Alexey Svyatkovskiy
What Lies Beneath Apache Spark's RDD API Using Spark shell and WebUI
A Web application for interactive data analysis with Spark - Romain Rigaux (Cloudera)
Production Spark and Tachyon use Cases
Training Data Science with Apache Spark
Spark Streaming: The State of the Union and the Road Beyond - Tathagata Das (Databricks)
Mobius: C# Language Binding For Spark
Apache Spark at Scale: A 60 TB+ Production Use Case (Sital Kedia)
Spark as the Gateway Drug to Typed Functional Programming: talk by Jeffrey Smith and Rohan Aletty