Data Science with Juliet Hougland and Michelle Casbon

Juliet Hougland and Michelle Casbon are on the podcast this week to talk about data science with Melanie and Mark. We had a great discussion about methodology, applications, tools, pipelines, challenges and resources. Juliet shared insights into the unique data science ownership workflow from idea to deployment at Stitch Fix, and Michelle dove into how Kubeflow is playing a role to help drive reliability in model development and deployment. Juliet Hougland Juliet Hougland leads the Workflow, Environment, and Execution team at Stichfix. She is a data scientist and engineer with expertise in computational mathematics and years of hands-on machine learning and big data experience. She has built and deployed production ML models, advised Fortune 500 companies on infrastructure and worked on a variety of open source projects (Apache Spark, Scalding, and Kiji) at the intersection of big data and machine learning. Michelle Casbon Michelle Casbon is a Senior Engineer on the Google Cloud Platform Developer Relations team, where she focuses on open source contributions and community engagement for machine learning and big data tools. Prior to joining Google, she was at several San Francisco-based startups as a Senior Engineer and Director of Data Science. Within these roles, she built and shipped machine learning products on distributed platforms using both AWS and GCP. Michelle’s development experience spans more than a decade and has primarily focused on multilingual natural language processing, system architecture and integration, and continuous delivery pipelines for machine learning applications. She especially loves working with open source projects and is an active contributor to Kubeflow. Michelle holds a masters degree from the University of Cambridge. Cool things of the week Sandeep Dinesh: Kubernetes Best Practices YouTube CNCF TOC voted to accept Helm as an incubation-level hosted project to CNCF site Andriod P in Beta blog Agones 0.2.0 site Securing cloud-connected devices with Cloud IoT and Microchip blog Interview flotilla-os repo Kubeflow repo Cloud Dataproc site & docs Spark site & community site scikit-learn site xgboost repo PyTorch site TensorFlow site and github Kubernetes site github Introducing ultramem Google Compute Engine machine types blog #114 Machine Learning Bias and Fairness with Timnit Gebru and Margaret Mitchell podcast Machine Learning Flash Clards site Open Source Data Science Masters site DockerCon SF site Question of the week If I have written a gRPC Service, but I’m using a language/platform that isn’t supported - is there any way I can access it as REST? grpc-gateway Envoy proxy Transcoding Where can you find us next? Mark is speaking at the San Francisco Kubernetes Meetup: Scaling Game Servers and the Conduit Service Mesh on June 14th. Melanie is speaking at a joint WiMLDS and PyLadies event “Paths to Data Science” on June 26th and Stanford AI4ALL on June 28th.

Popout Listen on the new Podbay