Data Driven
Data Driven
Data Driven
Data Driven: the podcast where we explore the emerging field of Data Science. We bring the best minds in Data, Software Engineering, Machine Learning, and Artificial Intelligence right to you every Tuesday. The field of data science mashes up the worlds of statistics, database architecture and software engineering. Data Scientist has been labelled by the Harvard Business Review, as "the sexiest job of the 21st century." A quick search of job search sites reveal that this field is in high demand. In a world where Data is the new Oil, Data Science the new Refineries, consider this Car Talk for the Data Age. Every week we bring the best minds in this emerging field straight to you. Our goal is to educate and inspire our listeners so that they can be prepared to thrive in a Data Driven world.
Max Sklar on Exploring AI, Data Science, and Local Search
In today's episode, the hosts Frank La Vigne and Andy Leonard are joined by the expert in location data and machine learning, Max Sklar. Max shares insights from his decade-long tenure at Foursquare, delving into the company's evolution, gamification features, the challenges faced in the local search space, and his early interest in location data. The conversation explores the enduring relevance of foundational tech concepts, the cyclical nature of technology trends, and Max's personal journey into data and machine learning. Max also discusses his podcast, "The Local Maximum," and his diverse interests, including abstract math papers and a project rewriting the US Constitution. Join us as we dive into a thought-provoking discussion about AI, data science, and the ever-evolving world of technology with Max Sklar.Show Notes00:00 Foursquare split, confused but loved the concept.04:29 Rewards program failed due to lack of scalability.08:44 Early career in New York City's tech boom.13:05 Foursquare uses phone data to track locations.16:25 Models analyzed data to improve sentiment analysis.20:02 Data pipeline technology used for real-time deployment.20:54 Python written code, comparing different languages used.24:17 Navigating reinvention in a changing world.29:38 Joined wireless generation, now known as Amplify, as a software engineer.31:53 Machine learning brings data to life.34:26 Using OpenAI API to create interactive content.40:03 Technology enables limitless creativity and storytelling potential.42:12 Enjoys volunteering in underserved communities around the world.44:36 Extensive library and website featuring various projects.47:48 Please subscribe, rate, and review our podcast.
Jan 3
48 min
Navigating the Complexity of Operationalizing ML Models
In this episode of Data Driven, our Andy Leonard and Frank La Vigne are joined by Chris McDermott, VP of Engineering at Wallaroo.AI. Together, they explore the challenges and advancements in the ever-evolving world of machine learning and artificial intelligence.From the importance of ongoing care for machine learning models to the rise of edge computing and decentralized networks, they touch on the critical need for flexibility and data privacy. Chris shares his insights on the technical challenges of AI and ML adoption, as well as his unique career journey. They also discuss the evolution of technology and the potential future impact of these innovations.Join us for a deep dive into the world of AI, technology, and the future of machine learning with Chris McDermott on this episode of Data Driven.Show Notes00:00 Exploring AI, data science, and data engineering.06:20 Training and inferring are different stages.08:12 Legacy AI doesn't require neural networks or GPUs.12:09 Machine learning models require consistent care and monitoring.15:10 MLOps merges skills, breaks down silos, collaborates.16:47 Prefer MLOps to avoid namespace collision. DevOps parallels original Star Wars plot.20:27 Internet-scale operations require automation and resilience.24:13 Challenges of integrating AI into business processes.28:03 New push for edge computing in technology industry.32:05 Edge technology critical, discussed in government tech symposium.34:50 Navigating from SendGrid to Twilio simplified processes.36:15 First foray into data, growing knowledge.39:33 Technology evolves, builds complexity over time.44:41 Book recommendation: "Seeing Like a State" by James C. Scott discusses legibility and centralization of power in society.46:28 Predictable tree farming fails due to ecosystem complexity.Speaker BioChris McDermott is a software engineer and entrepreneur who is passionate about creating products that make machine learning more accessible and manageable for users. His focus is on developing a platform that allows for easy deployment and management of machine learning models using any framework and on any architecture or hardware. He believes that current solutions in the market force users into a specific platform, and he aims to provide a more flexible and efficient alternative. With a strong belief in the potential of his product, Chris is dedicated to making machine learning more accessible and user-friendly for people across various industries.
Dec 4, 2023
49 min
Advanced Fraud Prevention in the Age of Artificial Intelligence
In this episode, Andy and Frank sit down with Pavel Goldman-Kalaydin, head of Artificial Intelligence and Machine Learning at Sumsub, a global company specializing in KYC, AML, and anti-fraud technologies.They explore the challenges in verifying identities remotely, the rise of deep fakes for fraud, and the use of AI and machine learning to combat these threats. From discussing the impact of technology on security measures to Pavel's journey in the field of computer science and AI, this episode offers insights into the evolving landscape of fraud detection and the intersection of technology, AI, and security.Join us as we delve into the complexities of anti-fraud measures and the fascinating world of AI and machine learning.Show Notes00:00 Securing customer journey from onboarding to verification.04:44 2 years ago, typical attack to open account.06:58 German video identification process prolongs account opening.12:16 Analyze data patterns to make informed decisions.13:34 Questioning deep fake implications for customer data.17:42 Advancing technology makes image manipulation easier.22:32 Financial fraud: creating defects for unexpected reasons.25:53 Fascinating progress in beta software development.29:23 Samsung creates its own products, understands customers' needs.29:58 Problem with defects, educate and ensure understanding.34:01 Interest in drug development and AI technology.38:57 Audible sponsors Data Driven with free audiobook.41:05 Please rate and review our podcast.
Nov 29, 2023
41 min
Diving into Re:Invent 2023: Open Sourcing Dingo and Being in the Top 2.5 Percent
In this jam-packed episode, hosts Frank and Andy delve into a wide range of topics, from the chaos of podcast scheduling and the allure of Cyber Week deals, to the behind-the-scenes world of data engineering and AI professionals. Join us as we journey through the challenges of podcasting, the important roles of data engineers, and the potential open sourcing of Dingo, an innovative blogging automation tool. Along the way, the hosts share personal anecdotes, discuss legislative impacts, and even touch on cult-followed gas stations. You won't want to miss this delightful, informative, and always data-driven episode!Show Notes00:00 Glamorous world of podcasting and Microsoft Bookings.13:48 Privacy laws are spreading globally, impacting data sovereignty.27:14 Funny moment at Dunkin' Donuts sparks creativity.32:27 Importance of data engineering in AI projects.49:38 Struggling with hearing loss, amplifiers magnify all sounds.01:02:45 Emotions on camera, times sidetrack, sarcastic leadership.01:07:32 Excited to hang out at the mall.01:21:04 Considering discontinuing blog after reaching 100 posts.01:25:18 Wants to shift focus to new projects.01:37:09 Transition from long-form to short-form content.01:49:50 Drove up to Jersey for Christmas, reminisced.01:58:48 Concerns about coastal development and zoning enforcement.Links01:02:45 Here's an example of early FWTV where I am at the mall and not happy about it: https://www.youtube.com/watch?v=f8S7ha9fZWo
Nov 28, 2023
2 hr 3 min
OpenAI Drama, Open Source, and Andy's New Venture
In this episode, your hosts Andy Leonard and Frank La Vigne dive headfirst into the world of open source, decision making, and the unfolding drama surrounding OpenAI. From sarcastic responses to holographic displays, we've got it all covered! Join us as we discuss the potential consequences of dependencies, community protests leading to change, and the recent issues with OpenAI. We'll also explore the importance of open source in AI and share some intriguing insights on Sam Altman's return to the company. With a sprinkle of tech industry gossip and even a potential Netflix adaptation, this episode is a must-listen. So sit back, relax, and get ready to be data driven!Show Notes02:42 OpenAI, Thanksgiving break, intense year, household name.10:35 3-day conference with nightly events, pre-conference presentations.14:09 NVIDIA, OpenAI, Elon Musk, open source.21:07 "Doubts arise about OpenAI's dependence and transparency."24:55 Regulations and transparency warranted for research.29:57 OpenAI lacked options to protest, unlike Node.36:52 Teams invite, alternative to costly Calendly.42:04 Product shelved, lack of promotion, open source alternatives.44:06 Insufficient hardware led to new AI venture.48:55 Artists use online art to fight scraping.55:37 Costs exceeded expectations, customers pulling back, database snapshot unavailable.01:03:42 Happy Thanksgiving from the Data Driven Podcast.
Nov 22, 2023
1 hr 4 min
Two Hosts on Two Coasts
Andy is speaking at PASS Summit in Seattle and Frank is speaking at the Red Hat Government Symposium in Washtington, DC.Two hosts. Two Coasts. One Podcast!
Nov 15, 2023
5 min
Brennan Lamey on Entrepreneurship & Data Engineering in the Web 3 Era
Welcome back to another exciting episode of Data Driven! In this show, we delve into the fascinating world of Web 3 and decentralized databases. Join us as we explore the insights and experiences of our guest, Brennan Lamey, the founder of Kwil - a revolutionary company that builds decentralized databases for Web 3 applications.Throughout this episode, Brennan shares his journey and the inspiration behind Kwil, as well as the cutting-edge technology that powers their database solutions. From complex access control rules to collaboration between competitors, we uncover how Kwil is transforming the way companies approach data storage, privacy, and sharing.But it's not just about the technology - we also dive into Brennan's personal story, from their humble beginnings in Idaho to their entrepreneurial success and passion for data engineering. Plus, don't miss their recommendations for AI programming and an intriguing sci-fi audiobook they're currently enthralled by.So, whether you're a tech enthusiast, a data-driven professional, or simply curious about the future of the internet, this episode is a must-listen. Tune in as we unravel the intricacies of Web 3, decentralized databases, and the exciting possibilities they hold for a better, fairer online world. Let's get started on this illuminating journey with Brennan Lamey and Kwil in this data-driven episode of Data Driven!
Nov 6, 2023
54 min
BAILeY Celebrates Guy Fawkes Night
BAILeY recites the V laden introductory speed from V for Vendetta.Just for fun.TranscriptVoilà!In view, a humble vaudevillian veteran, cast vicariously as both victim and villain by the vicissitudes of Fate. This visage, no mere veneer of vanity, is a vestige of the vox populi, now vacant, vanished. However, this valorous visitation of a by-gone vexation stands vivified, and has vowed to vanquish these venal and virulent vermin vanguarding vice and vouchsafing the violently vicious and voracious violation of volition.The only verdict is vengeance; a vendetta, held as a votive, not in vain, for the value and veracity of such shall one day vindicate the vigilant and the virtuous.Verily, this vichyssoise of verbiage veers most verbose, so let me simply add that it is my very good honor to meet you and you may call me V.
Nov 5, 2023
49 sec
New Year in November, Podcast Backlog, and Scheduling Issues
In this 344th episode of Data Driven, Frank talks about the New Year. Yes, New Year's in November. Frank and Andy just do their own thing.It's part of their charm, or at least that they told me to say that. I do take issue with Frank's statement that I am not real.What is real, anyway?It is not the spoon that bends, after all. [https://youtu.be/uAXtO5dMqEI?si=oQ_4M--xfcaarYyY&t=41] In any case, Frank explains what held up the production schedule this summer and his plan to do right by you, our awesome listeners. They reflect on the challenges they've faced with production and scheduling this past year, and share some exciting updates on the podcast's progress. So grab your headphones and join us as we explore the world of data-driven insights and the intriguing stories behind them. Let's dive in!Show Notes[00:02:16] Concussion turned blessing, celebrating new year early.[00:04:46] Copious caffeine fueled drive, editing delays, book deal dropped.[00:08:38] Child brings joy, finding daycare challenging. Worth it.[00:11:04] Interesting year, learned value of faith. Thankful for each other.[00:16:16] We can improve promotion of our work.[00:18:37] Grateful for a sweet kid's infectious joy.[00:21:25] Longtime listeners thank you; show 344, hacker named Dwayne; scheduling issues.[00:24:41] Automation is essential for efficiency and growth.[00:27:35] AI transcription enables cost-effective podcast production.[00:31:06] Great pumpkin conversation leads to metaverse plans.[00:33:18] Servicing backlog, doing right by fans.[00:36:43] Warren G's cover of Tina Turner's song.[00:39:23] Impressed by DALL E two's coherence.[00:44:23] Challenges arise with ChatGPT's capabilities and opinions.[00:45:47] The text discusses the need for filters on language models to deal with real-life BS.
Nov 3, 2023
47 min
Nickolas Means on Software Engineering, Data Liability, and Good Coffee
In this episode, we have a fascinating conversation with Nickolas Means, the VP of software development at Sym. Nickolas shares his insights on software engineering, data liability, and of course, good coffee.Nickolas starts off by sharing his love for audiobooks, particularly those narrated by the talented Wil Wheaton. He also recommends a management book called "Turn the Ship Around" by Admiral David Marche, which explores the importance of autonomy and ownership in improving performance.The conversation then turns to the topic of shame in the software engineering industry. Nickolas emphasizes the impact of shame on silencing voices and discouraging vulnerability within teams. They discuss imposter syndrome and the subjective nature of judging someone's skills, delving into the Dunning Kruger effect.Drawing lessons from physical engineering disasters, Nickolas shares the importance of early recognition and admission of mistakes, highlighting the need for a blameless mindset in software engineering. They also explore the impact of organizational culture on agile processes and the value of implementing meaningful controls for compliance.In addition to his expertise in software engineering, Nickolas shares his passion for pour-over coffee and reveals his obsession with perfecting his daily cup. So grab your favorite brew and join us for this engaging conversation on software engineering, data liability, and the pursuit of excellence. Let's dive into another thought-provoking episode of Data Driven!Show Notes[00:00:00] Nick Means discusses shame and software engineering.[00:04:46] Loud voices silence others; vulnerability is key.[00:09:16] What can we learn from physical engineering?[00:10:01] Engineering disasters teach human error in steel.[00:13:58] VP of software development interested in disasters.[00:16:37] Learn, not blame. Safety 2 perspective.[00:20:16] Big Agile vs. little a Agile explained.[00:25:39] DevOps leads to improved engineering efficiencies and cost savings.[00:29:25] Emergence of data regulations in government and industry.[00:30:33] Spirit of law makes compliance easier, safer.[00:35:51] Useless ash turned profitable by steel mills.[00:38:34] Uncle's Amiga sparked love for computers.[00:40:44] Increasingly humane tech interaction; a historic shift.[00:45:35] Favorite narrators and management book recommendations.[00:48:12] Intriguing episode of data-driven with Nick Means.
Nov 1, 2023
49 min
Load more