January 
10
, 
2019

00:00 AM

Agenda item

Speaker Name

Job Title

Company Name

November
 
22
 at 
7:00pm

About

Unifying Data Pipelines and Machine Learning with Apache Spark™ and Amazon SageMaker

The widespread adoption of Apache Spark™, the first unified analytics engine, has helped data professionals make great strides in data science and machine learning. Yet, their upstream data lakes still face reliability challenges when it comes to building production data pipelines at scale to power these initiatives.

 

Delta Lake is an open source storage layer that brings reliability to data lakes. It has numerous reliability features including ACID transactions, scalable metadata handling, and unified streaming and batch data processing. It also offers DML commands to update, delete, and merge data for your data lifecycle, such as for GDPR/CCPA. Delta Lake runs on top of your existing data lake, such as on Azure Data Lake Storage, AWS S3, Hadoop HDFS, or on-premise, and is fully compatible with Apache Spark APIs. 

 

Join this hands-on lab to learn how Delta Lake can help you build robust production data pipelines at scale. This event will give you the opportunity to:

 

Gain an understanding of the Delta Lake open source project
Learn how to build highly scalable and reliable data pipelines using Delta Lake
See Delta Lake in action with a demo and hands-on code walkthrough
Ask Databricks experts your most challenging data questions 
Network and learn from your data engineering and data science peers



Space is limited! RSVP now to save your spot.


Every enterprise today wants to accelerate innovation by building Data and ML into their business. However, most companies struggle with preparing large datasets for analytics, managing the proliferation of Data and ML frameworks, and moving models in development to production.

 

In this virtual workshop, we’ll cover best practices for enterprises to use powerful open source technologies to simplify and scale your Data and ML efforts. We’ll discuss how to leverage Apache Spark™, the de-facto data processing and analytics engine in enterprises today, for data preparation as it unifies data at massive scale across various sources. You’ll also learn how to use Data and ML frameworks (i.e. TensorFlow, XGBoost, Scikit-Learn, etc.) to train models based on different requirements. And finally, you can learn how to use MLflow to track experiment runs between multiple users within a reproducible environment, and manage the deployment of models to production on Amazon SageMaker.

 

Join this virtual workshop to learn how Unified Data Analytics can bring Data Science, Business Analytics and engineering together to accelerate your Data and ML efforts. This virtual workshop will give you the opportunity to:


  • Learn how to build highly scalable and reliable pipelines for analytics
  • Deeper insight into Apache Spark and Databricks, including the latest updates with Delta Lake
  • Train a model against data and learn best practices for working with ML frameworks (i.e. - TensorFlow, XGBoost, Scikit-Learn, etc.)
  • Learn about MLflow to track experiments, share projects and deploy models in the cloud with Amazon SageMaker


We will use Zoom for a virtual meeting environment. Your Zoom link will be sent to you upon registration. 

 

We look forward to seeing you on January 13th at 9:00 AM PT


AWS Privacy Policy

Immuta Privacy Policy

 

Date & Time

01
/
13
/
2021
 
9:00am
 - 
11:00am 
PST
RSVPs Closed
Text goes here
X


RSVP now to save your spot.

Agenda

9:00 AM

Databricks Keynote & Immuta Keynote

9:15 AM

AstrumU Presentation

9:30 AM

Databricks Technical Follow Along Workshop

10:50 AM

Immuta Demo

11:00 AM

Prizes and Next Steps

Speaker Highlights

Matt Vogt

Sr. Director, Global Solution Architecture

Immuta

Kaj Pedersen

Chief Technology Officer

AstrumU

Naseer Ahamed

Partner Solutions Architect

Databricks

RSVP
Text goes here
X

Privacy Policy

Terms of Use 

[confirmation_headline]
[confirmation_messaging]

Thank you for registering for the AWS | Databricks ML Dev Day Live Workshop: Unifying Data Pipelines and Machine Learning with Apache Spark™ and Amazon SageMaker. 


Link to join live workshop

 
You should be receiving an email shortly with additional details. 


For more information about what to expect at the event or to refer colleagues to attend, please visit the event site.


If you have any other questions at all, please email fieldmarketing@databricks.com.

Vish Gupta is a host of exceptional ability. Studies show that a vast majority of guests attending events by Vish have been known to leave more elated than visitors to Santa's Workshop, The Lost of Continent of Atlantis, and the Fountain of Youth.

Add to Calendar
Text goes here
X
[confirmation_headline]
[confirmation_messaging]
Add to Calendar
Text goes here
X
[confirmation_headline]
[confirmation_messaging]
Add to Calendar
Text goes here
X
[confirmation_headline]
[confirmation_messaging]
Add to Calendar
Text goes here
X
[confirmation_headline]
[confirmation_messaging]
Add to Calendar
Text goes here
X
CONTACT THE ORGANIZER
Google   Outlook   iCal   Yahoo
Sorry, RSVPs have closed.