City Storage Systems (CSS) revolutionizes its machine learning infrastructure by embracing Daft, a powerful DataFrame library seamlessly integrated with Ray. This session unveils how CSS transforms its data processing and ETL workflows, moving beyond traditional Spark clusters to a more unified and efficient Ray-based ecosystem.
Ammar Alrashed, Santosh Jha, and Garret Weaver showcase real-world applications from CloudKitchens, demonstrating how Daft's intuitive interface not only matches but often exceeds PySpark's capabilities. They explore the benefits of consolidating operations across individual and cluster environments, highlighting the synergy between Ray and Daft in supporting end-to-end machine learning pipelines. Attendees gain insights into practical strategies for streamlining data engineering and model development workflows, applicable to a wide range of data-intensive industries.
--
Interested in more?
Watch the full Day 1 Keynote: • Ray Summit 2024 Keynote Day 1 | Where...
Watch the full Day 2 Keynote • Ray Summit 2024 Keynote Day 2 | Where...
Check out the Ray Summmit Breakout sessions • Ray Summit 2024 - Breakout Sessions
--
🔗 Connect with us:
Subscribe to our YouTube channel: / @anyscale
Twitter: https://x.com/anyscalecompute
LinkedIn: / joinanyscale
Website: https://www.anyscale.com