Name: Building Complex Data Analytics Pipelines with Ray - Qingqing Mao, Dascena
Start: 2020-10-01T16:50:00-0700
End: 2020-10-01T17:20:00-0700

View More Details for Ray Summit & Registration Information.
Please note: All Sessions are in Pacific Daylight Time (PDT), UTC-7

Back To Schedule

Building Complex Data Analytics Pipelines with Ray - Qingqing Mao, Dascena

Feedback form is now closed.

Building scalable data analytics pipelines is challenging, especially when different subtasks may have different computational requirements and interdependencies. It becomes more challenging when you need to serve enterprise customers who have strict data security and privacy policies and require on-premise deployment. The scaling requirement and computational capacity often vary widely from site to site.

We have been using Ray to build natural language processing pipelines and healthcare analysis pipelines. The highly efficient serialization using a shared-memory object store is a perfect fit for handling our data-intensive jobs. Ray helps us narrow the gap between data science and engineering, and it enables our data scientists to write high-performance and cost-efficient data analytics pipelines that can scale.

Speakers

Qingqing Mao

Head of Engineering and Data Science, Dascena

Qingqing Mao is the Head of Engineering and Data Science at Dascena, where he leads the development of compliant and scalable clinical data pipelines and the research on applying machine learning techniques in healthcare and medicine. Previously, he worked as a senior staff data scientist... Read More →

Building Complex Data Analytics Pipelines with Ray Qingqing Mao, Dascena pdf

Thursday October 1, 2020 4:50pm - 5:20pm PDT
Virtual 2

Natural Language Processing

Slides Included Yes

Ray Summit 2020

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Qingqing Mao