The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

By Sam Charrington

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store.


Category: Tech News

Open in Apple Podcasts


Open RSS feed


Open Website


Rate for this podcast

Subscribers: 1590
Reviews: 3


 Dec 25, 2018

Elvis Alive
 Jul 20, 2018
Great podcast covering both business and technical aspects of ML and AI

A Podcast Republic user
 Jul 9, 2018

Description

Machine learning and artificial intelligence are dramatically changing the way businesses operate and people live. The TWIML AI Podcast brings the top minds and ideas from the world of ML and AI to a broad and influential community of ML/AI researchers, data scientists, engineers and tech-savvy business and IT leaders. Hosted by Sam Charrington, a sought after industry analyst, speaker, commentator and thought leader. Technologies covered include machine learning, artificial intelligence, deep learning, natural language processing, neural networks, analytics, computer science, data science and more.

Episode Date
Scaling AI at H&M Group with Errol Koolmeister - #503
40:46

Today we’re joined by Errol Koolmeister, the head of AI foundation at  H&M Group.

In our conversation with Errol, we explore H&M’s AI journey, including its wide adoption across the company in 2016, and the various use cases in which it's deployed like fashion forecasting and pricing algorithms. We discuss Errol’s first steps in taking on the challenge of scaling AI broadly at the company, the value-added learning from proof of concepts, and how to align in a sustainable, long-term way. Of course, we dig into the infrastructure and models being used, the biggest challenges faced, and the importance of managing the project portfolio, while Errol shares their approach to building infra for a specific product with many products in mind.

Jul 22, 2021
Evolving AI Systems Gracefully with Stefano Soatto - #502
47:40

Today we’re joined by Stefano Soatto, VP of AI applications science at AWS and a professor of computer science at UCLA. 

Our conversation with Stefano centers on recent research of his called Graceful AI, which focuses on how to make trained systems evolve gracefully. We discuss the broader motivation for this research and the potential dangers or negative effects of constantly retraining ML models in production. We also talk about research into error rate clustering, the importance of model architecture when dealing with problems of model compression, how they’ve solved problems of regression and reprocessing by utilizing existing models, and much more.

The complete show notes for this episode can be found at twimlai.com/go/502.

Jul 19, 2021
ML Innovation in Healthcare with Suchi Saria - #501
44:45

Today we’re joined by Suchi Saria, the founder and CEO of Bayesian Health, the John C. Malone associate professor of computer science, statistics, and health policy, and the director of the machine learning and healthcare lab at Johns Hopkins University. 

Suchi shares a bit about her journey to working in the intersection of machine learning and healthcare, and how her research has spanned across both medical policy and discovery. We discuss why it has taken so long for machine learning to become accepted and adopted by the healthcare infrastructure and where exactly we stand in the adoption process, where there have been “pockets” of tangible success. 

Finally, we explore the state of healthcare data, and of course, we talk about Suchi’s recently announced startup Bayesian Health and their goals in the healthcare space, and an accompanying study that looks at real-time ML inference in an EMR setting.

The complete show notes for this episode can be found at twimlai.com/go/501.

Jul 15, 2021
Cross-Device AI Acceleration, Compilation & Execution with Jeff Gehlhaar - #500
41:13

Today we’re joined by a friend of the show Jeff Gehlhaar, VP of technology and the head of AI software platforms at Qualcomm. 

In our conversation with Jeff, we cover a ton of ground, starting with a bit of exploration around ML compilers, what they are, and their role in solving issues of parallelism. We also dig into the latest additions to the Snapdragon platform, AI Engine Direct, and how it works as a bridge to bring more capabilities across their platform, how benchmarking works in the context of the platform, how the work of other researchers we’ve spoken to on compression and quantization finds its way from research to product, and much more! 

After you check out this interview, you can look below for some of the other conversations with researchers mentioned. 

The complete show notes for this episode can be found at twimlai.com/go/500.

Jul 12, 2021
The Future of Human-Machine Interaction with Dan Bohus and Siddhartha Sen - #499
46:57

Today we continue our AI in Innovation series joined by Dan Bohus, senior principal researcher at Microsoft Research, and Siddhartha Sen, a principal researcher at Microsoft Research. 

In this conversation, we use a pair of research projects, Maia Chess and Situated Interaction, to springboard us into a conversation about the evolution of human-AI interaction. We discuss both of these projects individually, as well as the commonalities they have, how themes like understanding the human experience appear in their work, the types of models being used, the various types of data, and the complexity of each of their setups. 

We explore some of the challenges associated with getting computers to better understand human behavior and interact in ways that are more fluid. Finally, we touch on what excites both Dan and Sid about their respective projects, and what they’re excited about for the future.  

The complete show notes for this episode can be found at https://twimlai.com/go/499.

Jul 08, 2021
Vector Quantization for NN Compression with Julieta Martinez - #498
39:34

Today we’re joined by Julieta Martinez, a senior research scientist at recently announced startup Waabi. 

Julieta was a keynote speaker at the recent LatinX in AI workshop at CVPR, and our conversation focuses on her talk “What do Large-Scale Visual Search and Neural Network Compression have in Common,” which shows that multiple ideas from large-scale visual search can be used to achieve state-of-the-art neural network compression. We explore the commonality between large databases and dealing with high dimensional, many-parameter neural networks, the advantages of using product quantization, and how that plays out when using it to compress a neural network. 

We also dig into another paper Julieta presented at the conference, Deep Multi-Task Learning for Joint Localization, Perception, and Prediction, which details an architecture that is able to reuse computation between the three tasks, and is thus able to correct localization errors efficiently.

The complete show notes for this episode can be found at twimlai.com/go/498.

Jul 05, 2021
Deep Unsupervised Learning for Climate Informatics with Claire Monteleoni - #497
40:30

Today we continue our CVPR 2021 coverage joined by Claire Monteleoni, an associate professor at the University of Colorado Boulder. 

We cover quite a bit of ground in our conversation with Claire, including her journey down the path from environmental activist to one of the leading climate informatics researchers in the world. We explore her current research interests, and the available opportunities in applying machine learning to climate informatics, including the interesting position of doing ML from a data-rich environment. 

Finally, we dig into the evolution of climate science-focused events and conferences, as well as the Keynote Claire gave at the EarthVision workshop at CVPR “Deep Unsupervised Learning for Climate Informatics,” which focused on semi- and unsupervised deep learning approaches to studying rare and extreme climate events.

The complete show notes for this episode can be found at twimlai.com/go/497.

Jul 01, 2021
Skip-Convolutions for Efficient Video Processing with Amir Habibian - #496
46:15

Today we kick off our CVPR coverage joined by Amir Habibian, a senior staff engineer manager at Qualcomm Technologies. 

In our conversation with Amir, whose research primarily focuses on video perception, we discuss a few papers they presented at the event. We explore the paper Skip-Convolutions for Efficient Video Processing, which looks at training discrete variables to end to end into visual neural networks. We also discuss his work on his FrameExit paper, which proposes a conditional early exiting framework for efficient video recognition. 

The complete show notes for this episode can be found at twimlai.com/go/496.

Jun 28, 2021
Advancing NLP with Project Debater w/ Noam Slonim - #495
51:01

Today we’re joined by Noam Slonim, the principal investigator of Project Debater at IBM Research. 

In our conversation with Noam, we explore the history of Project Debater, the first AI system that can “debate” humans on complex topics. We also dig into the evolution of the project, which is the culmination of 7 years and over 50 research papers, and eventually becoming a Nature cover paper, “An Autonomous Debating System,” which details the system in its entirety. 

Finally, Noam details many of the underlying capabilities of Debater, including the relationship between systems preparation and training, evidence detection, detecting the quality of arguments, narrative generation, the use of conventional NLP methods like entity linking, and much more.

The complete show notes for this episode can be found at twimlai.com/go/495.

Jun 24, 2021
Bringing AI Up to Speed with Autonomous Racing w/ Madhur Behl - #494
50:55

Today we’re joined by Madhur Behl, an Assistant Professor in the department of computer science at the University of Virginia. 

In our conversation with Madhur, we explore the super interesting work he’s doing at the intersection of autonomous driving, ML/AI, and Motorsports, where he’s teaching self-driving cars how to drive in an agile manner. We talk through the differences between traditional self-driving problems and those encountered in a racing environment, the challenges in solving planning, perception, control. 

We also discuss their upcoming race at the Indianapolis Motor Speedway, where Madhur and his students will compete for 1 million dollars in the world’s first head-to-head fully autonomous race, and how they’re preparing for it.

Jun 21, 2021
AI and Society: Past, Present and Future with Eric Horvitz - #493
41:42

Today we continue our  AI Innovation series joined by Microsoft’s Chief Scientific Officer, Eric Horvitz. 

In our conversation with Eric, we explore his tenure as AAAI president and his focus on the future of AI and its ethical implications, the scope of the study on the topic, and how drastically the AI and machine learning landscape has changed since 2009. We also discuss Eric’s role at Microsoft and the Aether committee that has advised the company on issues of responsible AI since 2017.

Finally, we talk through his recent work as a member of the National Security Commission on AI, where he helped commission a 750+ page report on topics including the Future of AI R&D, Building Trustworthy AI systems, civil liberties and privacy, and the challenging area of AI and autonomous weapons.  

The complete show notes for this episode can be found at twimlai.com/go/493.

Jun 17, 2021
Agile Applied AI Research with Parvez Ahammad - #492
43:52

Today we’re joined by Parvez Ahammad, head of data science applied research at LinkedIn.

In our conversation, Parvez shares his interesting take on organizing principles for his organization, starting with how data science teams are broadly organized at LinkedIn. We explore how they ensure time investments on long-term projects are managed, how to identify products that can help in a cross-cutting way across multiple lines of business, quantitative methodologies to identify unintended consequences in experimentation, and navigating the tension between research and applied ML teams in an organization. Finally, we discuss differential privacy, and their recently released GreyKite library, an open-source Python library developed to support forecasting.

The complete show note for this episode can be found at twimlai.com/go/492.

Jun 14, 2021
Haptic Intelligence with Katherine J. Kuchenbecker - #491
38:40

Today we’re joined Katherine J. Kuchenbecker, director at the Max Planck Institute for Intelligent Systems and of the haptic intelligence department. 

In our conversation, we explore Katherine’s research interests, which lie at the intersection of haptics (physical interaction with the world) and machine learning, introducing us to the concept of “haptic intelligence.” We discuss how ML, mainly computer vision, has been integrated to work together with robots, and some of the devices that Katherine’s lab is developing to take advantage of this research.

We also talk about hugging robots, augmented reality in robotic surgery, and the degree to which she studies human-robot interaction. Finally, Katherine shares with us her passion for mentoring and the importance of diversity and inclusion in robotics and machine learning. 

The complete show notes for this episode can be found at twimlai.com/go/491.

Jun 10, 2021
Data Science on AWS with Chris Fregly and Antje Barth - #490
38:59

Today we continue our coverage of the AWS ML Summit joined by Chris Fregly, a principal developer advocate at AWS, and Antje Barth, a senior developer advocate at AWS. 

In our conversation with Chris and Antje, we explore their roles as community builders prior to, and since, joining AWS, as well as their recently released book Data Science on AWS. In the book, Chris and Antje demonstrate how to reduce cost and improve performance while successfully building and deploying data science projects. 

We also discuss the release of their new Practical Data Science Specialization on Coursera, managing the complexity that comes with building real-world projects, and some of their favorite sessions from the recent ML Summit.

Jun 07, 2021
Accelerating Distributed AI Applications at Qualcomm with Ziad Asghar - #489
38:34

Today we’re joined by Ziad Asghar, vice president of product management for snapdragon technologies & roadmap at Qualcomm Technologies. 

We begin our conversation with Ziad exploring the symbiosis between 5G and AI and what is enabling developers to take full advantage of AI on mobile devices. We also discuss the balance of product evolution and incorporating research concepts, and the evolution of their hardware infrastructure Cloud AI 100, their role in the deployment of Ingenuity, the robotic helicopter that operated on Mars just last year. 

Finally, we talk about specialization in building IoT applications like autonomous vehicles and smart cities, the degree to which federated learning is being deployed across the industry, and the importance of privacy and security of personal data. 

The complete show notes can be found at https://twimlai.com/go/489.

Jun 03, 2021
Buy AND Build for Production Machine Learning with Nir Bar-Lev - #488
42:40

Today we’re joined by Nir Bar-Lev, co-founder and CEO of ClearML.

In our conversation with Nir, we explore how his view of the wide vs deep machine learning platforms paradox has changed and evolved over time, how companies should think about building vs buying and integration, and his thoughts on why experiment management has become an automatic buy, be it open source or otherwise. 

We also discuss the disadvantages of using a cloud vendor as opposed to a software-based approach, the balance between mlops and data science when addressing issues of overfitting, and how ClearML is applying techniques like federated machine learning and transfer learning to their solutions.


The complete show notes for this episode can be found at https://twimlai.com/go/488.

May 31, 2021
Applied AI Research at AWS with Alex Smola - #487
54:40

Today we’re joined by Alex Smola, Vice President and Distinguished Scientist at AWS AI.

We had the pleasure to catch up with Alex prior to the upcoming AWS Machine Learning Summit, and we covered a TON of ground in the conversation. We start by focusing on his research in the domain of deep learning on graphs, including a few examples showcasing its function, and an interesting discussion around the relationship between large language models and graphs. Next up, we discuss their focus on AutoML research and how it's the key to lowering the barrier of entry for machine learning research.

Alex also shares a bit about his work on causality and causal modeling, introducing us to the concept of Granger causality. Finally, we talk about the aforementioned ML Summit, its exponential growth since its inception a few years ago, and what speakers he's most excited about hearing from.

The complete show notes for this episode can be found at https://twimlai.com/go/487.

May 27, 2021
Causal Models in Practice at Lyft with Sean Taylor - #486
54:08

Today we’re joined by Sean Taylor, Staff Data Scientist at Lyft Rideshare Labs.

We cover a lot of ground with Sean, starting with his recent decision to step away from his previous role as the lab director to take a more hands-on role, and what inspired that change. We also discuss his research at Rideshare Labs, where they take a more “moonshot” approach to solving the typical problems like forecasting and planning, marketplace experimentation, and decision making, and how his statistical approach manifests itself in his work.

Finally, we spend quite a bit of time exploring the role of causality in the work at rideshare labs, including how systems like the aforementioned forecasting system are designed around causal models, if driving model development is more effective using business metrics, challenges associated with hierarchical modeling, and much much more.

The complete show notes for this episode can be found at twimlai.com/go/486.

May 24, 2021
Using AI to Map the Human Immune System w/ Jabran Zahid - #485
39:19

Today we’re joined by Jabran Zahid, a Senior Researcher at Microsoft Research.

In our conversation with Jabran, we explore their recent endeavor into the complete mapping of which T-cells bind to which antigens through the Antigen Map Project. We discuss how Jabran’s background in astrophysics and cosmology has translated to his current work in immunology and biology, the origins of the antigen map, the biological and how the focus was changed by the emergence of the coronavirus pandemic.

We talk through the biological advancements, and the challenges of using machine learning in this setting, some of the more advanced ML techniques that they’ve tried that have not panned out (as of yet), the path forward for the antigen map to make a broader impact, and much more.

The complete show notes for this episode can be found at twimlai.com/go/485.

May 20, 2021
Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484
36:15

Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich.

In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of recurrent neural networks for learning long-time dependencies.

We explore the inspiration he drew from neuroscience when tackling this problem, how the performance results compared to networks like LSTMs and others that have been proven to work on this problem and Konstantin’s future research goals.

The complete show notes for this episode can be found at twimlai.com/go/484.

May 17, 2021
What the Human Brain Can Tell Us About NLP Models with Allyson Ettinger - #483
36:33

Today we continue our ICLR ‘21 series joined by Allyson Ettinger, an Assistant Professor at the University of Chicago. 

One of our favorite recurring conversations on the podcast is the two-way street that lies between machine learning and neuroscience, which Allyson explores through the modeling of cognitive processes that pertain to language. In our conversation, we discuss how she approaches assessing the competencies of AI, the value of control of confounding variables in AI research, and how the pattern matching traits of Ml/DL models are not necessarily exclusive to these systems. 

Allyson also participated in a recent panel discussion at the ICLR workshop How Can Findings About The Brain Improve AI Systems?, centered around the utility of brain inspiration for developing AI models. We discuss ways in which we can try to more closely simulate the functioning of a brain, where her work fits into the analysis and interpretability area of NLP, and much more!

The complete show notes for this episode can be found at twimlai.com/go/483

May 13, 2021
Probabilistic Numeric CNNs with Roberto Bondesan - #482
39:31

Today we kick off our ICLR 2021 coverage joined by Roberto Bondesan, an AI Researcher at Qualcomm. 

In our conversation with Roberto, we explore his paper Probabilistic Numeric Convolutional Neural Networks, which represents features as Gaussian processes, providing a probabilistic description of discretization error. We discuss some of the other work the team at Qualcomm presented at the conference, including a paper called Adaptive Neural Compression, as well as work on Guage Equvariant Mesh CNNs. Finally, we briefly discuss quantum deep learning, and what excites Roberto and his team about the future of their research in combinatorial optimization.  

The complete show notes for this episode can be found at https://twimlai.com/go/482

May 10, 2021
Building a Unified NLP Framework at LinkedIn with Huiji Gao - #481
33:45

Today we’re joined by Huiji Gao, a Senior Engineering Manager of Machine Learning and AI at LinkedIn. 

In our conversation with Huiji, we dig into his interest in building NLP tools and systems, including a recent open-source project called DeText, a framework for generating models for ranking classification and language generation. We explore the motivation behind DeText, the landscape at LinkedIn before and after it was put into use broadly, and the various contexts it’s being used in at the company. We also discuss the relationship between BERT and DeText via LiBERT, a version of BERT that is trained and calibrated on LinkedIn data, the practical use of these tools from an engineering perspective, the approach they’ve taken to optimization, and much more!

The complete show notes for this episode can be found at https://twimlai.com/go/481

May 06, 2021
Dask + Data Science Careers with Jacqueline Nolis - #480
33:36

Today we’re joined by Jacqueline Nolis, Head of Data Science at Saturn Cloud, and co-host of the Build a Career in Data Science Podcast

You might remember Jacqueline from our Advancing Your Data Science Career During the Pandemic panel, where she shared her experience trying to navigate the suddenly hectic data science job market. Now, a year removed from that panel, we explore her book on data science careers, top insights for folks just getting into the field, ways that job seekers should be signaling that they have the required background, and how to approach and navigate failure as a data scientist. 

We also spend quite a bit of time discussing Dask, an open-source library for parallel computing in Python, as well as use cases for the tool, the relationship between dask and Kubernetes and docker containers, where data scientists are in regards to the software development toolchain and much more!

The complete show notes for this episode can be found at https://twimlai.com/go/480.

 

May 03, 2021
Machine Learning for Equitable Healthcare Outcomes with Irene Chen - #479
35:28

Today we’re joined by Irene Chen, a Ph.D. student at MIT. 

Irene’s research is focused on developing new machine learning methods specifically for healthcare, through the lens of questions of equity and inclusion. In our conversation, we explore some of the various projects that Irene has worked on, including an early detection program for intimate partner violence. 

We also discuss how she thinks about the long term implications of predictions in the healthcare domain, how she’s learned to communicate across the interface between the ML researcher and clinician, probabilistic approaches to machine learning for healthcare, and finally, key takeaways for those of you interested in this area of research.

The complete show notes for this episode can be found at https://twimlai.com/go/479.

Apr 29, 2021
AI Storytelling Systems with Mark Riedl - #478
40:06

Today we’re joined by Mark Riedl, a Professor in the School of Interactive Computing at Georgia Tech. In our conversation with Mark, we explore his work building AI storytelling systems, mainly those that try and predict what listeners think will happen next in a story and how he brings together many different threads of ML/AI together to solve these problems. We discuss how the theory of mind is layered into his research, the use of large language models like GPT-3, and his push towards being able to generate suspenseful stories with these systems. 

We also discuss the concept of intentional creativity and the lack of good theory on the subject, the adjacent areas in ML that he’s most excited about for their potential contribution to his research, his recent focus on model explainability, how he approaches problems of common sense, and much more! 

The complete show notes for this episode can be found at https://twimlai.com/go/478.

Apr 26, 2021
Creating Robust Language Representations with Jamie Macbeth - #477
38:42

Today we’re joined by Jamie Macbeth, an assistant professor in the department of computer science at Smith College. 

In our conversation with Jamie, we explore his work at the intersection of cognitive systems and natural language understanding, and how to use AI as a vehicle for better understanding human intelligence. We discuss the tie that binds these domains together, if the tasks are the same as traditional NLU tasks, and what are the specific things he’s trying to gain deeper insights into.

One of the unique aspects of Jamie’s research is that he takes an “old-school AI” approach, and to that end, we discuss the models he handcrafts to generate language. Finally, we examine how he evaluates the performance of his representations if he’s not playing the SOTA “game,” what he bookmarks against, identifying deficiencies in deep learning systems, and the exciting directions for his upcoming research. 

The complete show notes for this episode can be found at https://twimlai.com/go/477.

Apr 21, 2021
Reinforcement Learning for Industrial AI with Pieter Abbeel - #476
56:56

Today we’re joined by Pieter Abbeel, a Professor at UC Berkeley, co-Director of the Berkeley AI Research Lab (BAIR), as well as Co-founder and Chief Scientist at Covariant.

In our conversation with Pieter, we cover a ton of ground, starting with the specific goals and tasks of his work at Covariant, the shift in needs for industrial AI application and robots, if his experience solving real-world problems has changed his opinion on end to end deep learning, and the scope for the three problem domains of the models he’s building.

We also explore his recent work at the intersection of unsupervised and reinforcement learning, goal-directed RL, his recent paper “Pretrained Transformers as Universal Computation Engines” and where that research thread is headed, and of course, his new podcast Robot Brains, which you can find on all streaming platforms today!

The complete show notes for this episode can be found at twimlai.com/go/476.

Apr 19, 2021
AutoML for Natural Language Processing with Abhishek Thakur - #475
34:54

Today we’re joined by Abhishek Thakur, a machine learning engineer at Hugging Face, and the world’s first Quadruple Kaggle Grandmaster!

In our conversation with Abhishek, we explore his Kaggle journey, including how his approach to competitions has evolved over time, what resources he used to prepare for his transition to a full-time practitioner, and the most important lessons he’s learned along the way.

We also spend a great deal of time discussing his new role at HuggingFace, where he's building AutoNLP. We talk through the goals of the project, the primary problem domain, and how the results of AutoNLP compare with those from hand-crafted models. Finally, we discuss Abhishek’s book, Approaching (Almost) Any Machine Learning Problem.

The complete show notes for this episode can be found at https://twimlai.com/go/475.

Apr 15, 2021
Inclusive Design for Seeing AI with Saqib Shaikh - #474
35:21

Today we’re joined by Saqib Shaikh, a Software Engineer at Microsoft, and the lead for the Seeing AI Project.

In our conversation with Saqib, we explore the Seeing AI app, an app “that narrates the world around you.” We discuss the various technologies and use cases for the app, and how it has evolved since the inception of the project, how the technology landscape supports projects like this one, and the technical challenges he faces when building out the app.

We also the relationship and trust between humans and robots, and how that translates to this app, what Saqib sees on the research horizon that will support his vision for the future of Seeing AI, and how the integration of tech like Apple’s upcoming “smart” glasses could change the way their app is used.

The complete show notes for this episode can be found at twimlai.com/go/474.

Apr 12, 2021
Theory of Computation with Jelani Nelson - #473
33:20

Today we’re joined by Jelani Nelson, a professor in the Theory Group at UC Berkeley.

In our conversation with Jelani, we explore his research in computational theory, where he focuses on building streaming and sketching algorithms, random projections, and dimensionality reduction. We discuss how Jelani thinks about the balance between the innovation of new algorithms and the performance of existing ones, and some use cases where we’d see his work in action.

Finally, we talk through how his work ties into machine learning, what tools from the theorist’s toolbox he’d suggest all ML practitioners know, and his nonprofit AddisCoder, a 4 week summer program that introduces high-school students to programming and algorithms.

The complete show notes for this episode can be found at twimlai.com/go/473.

Apr 08, 2021
Human-Centered ML for High-Risk Behaviors with Stevie Chancellor - #472
39:50

Today we’re joined by Stevie Chancellor, an Assistant Professor in the Department of Computer Science and Engineering at the University of Minnesota.

In our conversation with Stevie, we explore her work at the intersection of human-centered computing, machine learning, and high-risk mental illness behaviors. We discuss how her background in HCC helps shapes her perspective, how machine learning helps with understanding severity levels of mental illness, and some recent work where convolutional graph neural networks are applied to identify and discover new kinds of behaviors for people who struggle with opioid use disorder.

We also explore the role of computational linguistics and NLP in her research, issues in using social media data being used as a data source, and finally, how people who are interested in an introduction to human-centered computing can get started.

The complete show notes for this episode can be found at twimlai.com/go/472.

Apr 05, 2021
Operationalizing AI at Dataiku with Conor Jensen - #471
23:38

In this episode, we’re joined by Dataiku’s Director of Data Science, Conor Jensen. In our conversation, we explore the panel he lead at TWIMLcon “AI Operationalization: Where the AI Rubber Hits the Road for the Enterprise,” discussing the ML journey of each panelist’s company, and where Dataiku fits in the equation.

The complete show notes for this episode can be found at https://twimlai.com/go/471

Apr 01, 2021
ML Lifecycle Management at Algorithmia with Diego Oppenheimer - #470
25:11

In this episode, we’re joined by Diego Oppenheimer, Founder and CEO of Algorithmia. In our conversation, we discuss Algorithmia’s involvement with TWIMLcon, as well as an exploration of the results of their recently conducted survey on the state of the AI market.

The complete show notes for this episode can be found at twimlai.com/go/470.

Apr 01, 2021
End to End ML at Cloudera with Santiago Giraldo - #469 [TWIMLcon Sponsor Series]
22:09

In this episode, we’re joined by Santiago Giraldo, Director Of Product Marketing for Data Engineering & Machine Learning at Cloudera. In our conversation, we discuss Cloudera’s talks at TWIMLcon, as well as their various research efforts from their Fast Forward Labs arm.


The complete show notes for this episode can be found at twimlai.com/sponsorseries.

Mar 29, 2021
ML Platforms for Global Scale at Prosus with Paul van der Boor - #468 [TWIMLcon Sponsor Series]
21:50

In this episode, we’re joined by Paul van der Boor, Senior Director of Data Science at Prosus, to discuss his TWIMLcon experience and how they’re using ML platforms to manage machine learning at a global scale.

The complete show notes for this episode can be found at twimlai.com/sponsorseries.

Mar 29, 2021
Can Language Models Be Too Big? 🦜 with Emily Bender and Margaret Mitchell - #467
53:18

Today we’re joined by Emily M. Bender, Professor at the University of Washington, and AI Researcher, Margaret Mitchell. 

Emily and Meg, as well as Timnit Gebru and Angelina McMillan-Major, are co-authors on the paper On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜. As most of you undoubtedly know by now, there has been much controversy surrounding, and fallout from, this paper. In this conversation, our main priority was to focus on the message of the paper itself. We spend some time discussing the historical context for the paper, then turn to the goals of the paper, discussing the many reasons why the ever-growing datasets and models are not necessarily the direction we should be going. 

We explore the cost of these training datasets, both literal and environmental, as well as the bias implications of these models, and of course the perpetual debate about responsibility when building and deploying ML systems. Finally, we discuss the thin line between AI hype and useful AI systems, and the importance of doing pre-mortems to truly flesh out any issues you could potentially come across prior to building models, and much much more. 

The complete show notes for this episode can be found at twimlai.com/go/467.

Mar 24, 2021
Applying RL to Real-World Robotics with Abhishek Gupta - #466
34:35

Today we’re joined by Abhishek Gupta, a PhD Student at UC Berkeley. 

Abhishek, a member of the BAIR Lab, joined us to talk about his recent robotics and reinforcement learning research and interests, which focus on applying RL to real-world robotics applications. We explore the concept of reward supervision, and how to get robots to learn these reward functions from videos, and the rationale behind supervised experts in these experiments. 

We also discuss the use of simulation for experiments, data collection, and the path to scalable robotic learning. Finally, we discuss gradient surgery vs gradient sledgehammering, and his ecological RL paper, which focuses on the “phenomena that exist in the real world” and how humans and robotics systems interface in those situations. 

The complete show notes for this episode can be found at https://twimlai.com/go/466.

Mar 22, 2021
Accelerating Innovation with AI at Scale with David Carmona - #465
47:29

Today we’re joined by David Carmona, General Manager of Artificial Intelligence & Innovation at Microsoft. 

In our conversation with David, we focus on his work on AI at Scale, an initiative focused on the change in the ways people are developing AI, driven in large part by the emergence of massive models. We explore David’s thoughts about the progression towards larger models, the focus on parameters and how it ties to the architecture of these models, and how we should assess how attention works in these models.

We also discuss the different families of models (generation & representation), the transition from CV to NLP tasks, and an interesting point of models “becoming a platform” via transfer learning.

The complete show notes for this episode can be found at twimlai.com/go/465.

Mar 18, 2021
Complexity and Intelligence with Melanie Mitchell - #464
32:17

Today we’re joined by Melanie Mitchell, Davis Professor at the Santa Fe Institute and author of Artificial Intelligence: A Guide for Thinking Humans. 

While Melanie has had a long career with a myriad of research interests, we focus on a few, complex systems and the understanding of intelligence, complexity, and her recent work on getting AI systems to make analogies. We explore examples of social learning, and how it applies to AI contextually, and defining intelligence. 

We discuss potential frameworks that would help machines understand analogies, established benchmarks for analogy, and if there is a social learning solution to help machines figure out analogy. Finally we talk through the overall state of AI systems, the progress we’ve made amid the limited concept of social learning, if we’re able to achieve intelligence with current approaches to AI, and much more!

The complete show notes for this episode can be found at twimlai.com/go/464.

Mar 15, 2021
Robust Visual Reasoning with Adriana Kovashka - #463
40:05

Today we’re joined by Adriana Kovashka, an Assistant Professor at the University of Pittsburgh.

In our conversation with Adriana, we explore her visual commonsense research, and how it intersects with her background in media studies. We discuss the idea of shortcuts, or faults in visual question answering data sets that appear in many SOTA results, as well as the concept of masking, a technique developed to assist in context prediction. Adriana then describes how these techniques fit into her broader goal of trying to understand the rhetoric of visual advertisements. 

Finally, Adriana shares a bit about her work on robust visual reasoning, the parallels between this research and other work happening around explainability, and the vision for her work going forward. 

The complete show notes for this episode can be found at twimlai.com/go/463.

Mar 11, 2021
Architectural and Organizational Patterns in Machine Learning with Nishan Subedi - #462
57:01

Today we’re joined by Nishan Subedi, VP of Algorithms at Overstock.com.

In our conversation with Nishan, we discuss his interesting path to MLOps and how ML/AI is used at Overstock, primarily for search/recommendations and marketing/advertisement use cases. We spend a great deal of time exploring machine learning architecture and architectural patterns, how he perceives the differences between architectural patterns and algorithms, and emergent architectural patterns that standards have not yet been set for.

Finally, we discuss how the idea of anti-patterns was innovative in early design pattern thinking and if those concepts are transferable to ML, if architectural patterns will bleed over into organizational patterns and culture, and Nishan introduces us to the concept of Squads within an organizational structure.

The complete show notes for this episode can be found at https://twimlai.com/go/462.

Mar 08, 2021
Common Sense Reasoning in NLP with Vered Shwartz - #461
35:39

Today we’re joined by Vered Shwartz, a Postdoctoral Researcher at both the Allen Institute for AI and the Paul G. Allen School of Computer Science & Engineering at the University of Washington.

In our conversation with Vered, we explore her NLP research, where she focuses on teaching machines common sense reasoning in natural language. We discuss training using GPT models and the potential use of multimodal reasoning and incorporating images to augment the reasoning capabilities.

Finally, we talk through some other noteworthy research in this field, how she deals with biases in the models, and Vered's future plans for incorporating some of the newer techniques into her future research.

The complete show notes for this episode can be found at https://twimlai.com/go/461

Mar 04, 2021
How to Be Human in the Age of AI with Ayanna Howard - #460

Today we’re joined by returning guest and newly appointed Dean of the College of Engineering at The Ohio State University, Ayanna Howard. 

Our conversation with Dr. Howard focuses on her recently released book, Sex, Race, and Robots: How to Be Human in the Age of AI, which is an extension of her research on the relationships between humans and robots. We continue to explore this relationship through the themes of socialization introduced in the book, like associating genders to AI and robotic systems and the “self-fulfilling prophecy” that has become search engines. 

We also discuss a recurring conversation in the community around AI  being biased because of data versus models and data, and the choices and responsibilities that come with the ethical aspects of building AI systems. Finally, we discuss Dr. Howard’s new role at OSU, how it will affect her research, and what the future holds for the applied AI field. 

The complete show notes for this episode can be found at https://twimlai.com/go/460.

Mar 01, 2021
How to Be Human in the Age of AI with Ayanna Howard - #460
34:57

Today we’re joined by returning guest and newly appointed Dean of the College of Engineering at The Ohio State University, Ayanna Howard. 

Our conversation with Dr. Howard focuses on her recently released book, Sex, Race, and Robots: How to Be Human in the Age of AI, which is an extension of her research on the relationships between humans and robots. We continue to explore this relationship through the themes of socialization introduced in the book, like associating genders to AI and robotic systems and the “self-fulfilling prophecy” that has become search engines. 

We also discuss a recurring conversation in the community around AI  being biased because of data versus models and data, and the choices and responsibilities that come with the ethical aspects of building AI systems. Finally, we discuss Dr. Howard’s new role at OSU, how it will affect her research, and what the future holds for the applied AI field. 

The complete show notes for this episode can be found at https://twimlai.com/go/460.

Mar 01, 2021
Evolution and Intelligence with Penousal Machado - #459
57:00

Today we’re joined by Penousal Machado, Associate Professor and Head of the Computational Design and Visualization Lab in the Center for Informatics at the University of Coimbra. 

In our conversation with Penousal, we explore his research in Evolutionary Computation, and how that work coincides with his passion for images and graphics. We also discuss the link between creativity and humanity, and have an interesting sidebar about the philosophy of Sci-Fi in popular culture. 

Finally, we dig into Penousals evolutionary machine learning research, primarily in the context of the evolution of various animal species mating habits and practices.

The complete show notes for this episode can be found at twimlai.com/go/459.  

Feb 25, 2021
Innovating Neural Machine Translation with Arul Menezes - #458
42:33

Today we’re joined by Arul Menezes, a Distinguished Engineer at Microsoft. 

Arul, a 30 year veteran of Microsoft, manages the machine translation research and products in the Azure Cognitive Services group. In our conversation, we explore the historical evolution of machine translation like breakthroughs in seq2seq and the emergence of transformer models. 

We also discuss how they’re using multilingual transfer learning and combining what they’ve learned in translation with pre-trained language models like BERT. Finally, we explore what they’re doing to experience domain-specific improvements in their models, and what excites Arul about the translation architecture going forward. 

The complete show notes for this series can be found at twimlai.com/go/458.

Feb 22, 2021
Building the Product Knowledge Graph at Amazon with Luna Dong - #457
43:32

Today we’re joined by Luna Dong, Sr. Principal Scientist at Amazon.

In our conversation with Luna, we explore Amazon’s expansive product knowledge graph, and the various roles that machine learning plays throughout it. We also talk through the differences and synergies between the media and retail product knowledge graph use cases and how ML comes into play in search and recommendation use cases. Finally, we explore the similarities to relational databases and efforts to standardize the product knowledge graphs across the company and broadly in the research community.

The complete show notes for this episode can be found at https://twimlai.com/go/457.

Feb 18, 2021
Towards a Systems-Level Approach to Fair ML with Sarah M. Brown - #456
37:22

Today we’re joined by Sarah Brown, an Assistant Professor of Computer Science at the University of Rhode Island.

In our conversation with Sarah, whose research focuses on Fairness in AI, we discuss why a “systems-level” approach is necessary when thinking about ethical and fairness issues in models and algorithms. We also explore Wiggum: a fairness forensics tool, which explores bias and allows for regular auditing of data, as well as her ongoing collaboration with a social psychologist to explore how people perceive ethics and fairness.

Finally, we talk through the role of tools in assessing fairness and bias, and the importance of understanding the decisions the tools are making.

The complete show notes can be found at twimlai.com/go/456.

Feb 15, 2021
AI for Digital Health Innovation with Andrew Trister - #455
42:12

Today we’re joined by Andrew Trister, Deputy Director for Digital Health Innovation at the Bill & Melinda Gates Foundation. 

In our conversation with Andrew, we explore some of the AI use cases at the foundation, with the goal of bringing “community-based” healthcare to underserved populations in the global south. We focus on COVID-19 response and improving the accuracy of malaria testing with a bayesian framework and a few others, and the challenges like scaling these systems and building out infrastructure so that communities can begin to support themselves. 

We also touch on Andrew's previous work at Apple, where he helped develop what is now known as Research Kit, their ML for health tools that are now seen in apple devices like phones and watches.

The complete show notes for this episode can be found at https://twimlai.com/go/455

Feb 11, 2021
System Design for Autonomous Vehicles with Drago Anguelov - #454
50:39

Today we’re joined by Drago Anguelov, Distinguished Scientist and Head of Research at Waymo. 

In our conversation, we explore the state of the autonomous vehicles space broadly and at Waymo, including how AV has improved in the last few years, their focus on level 4 driving, and Drago’s thoughts on the direction of the industry going forward. Drago breaks down their core ML use cases, Perception, Prediction, Planning, and Simulation, and how their work has lead to a fully autonomous vehicle being deployed in Phoenix. 

We also discuss the socioeconomic and environmental impact of self-driving cars, a few research papers submitted to NeurIPS 2020, and if the sophistication of AV systems will lend themselves to the development of tomorrow’s enterprise machine learning systems.

The complete show notes for this episode can be found at twimlai.com/go/454

Feb 08, 2021
Building, Adopting, and Maturing LinkedIn's Machine Learning Platform with Ya Xu - #453
49:05

Today we’re joined by Ya Xu, head of Data Science at LinkedIn, and TWIMLcon: AI Platforms 2021 Keynote Speaker.

We cover a ton of ground with Ya, starting with her experiences prior to becoming Head of DS, as one of the architects of the LinkedIn Platform. We discuss her “three phases” (building, adoption, and maturation) to keep in mind when building out a platform, how to avoid “hero syndrome” early in the process.

Finally, we dig into the various tools and platforms that give LinkedIn teams leverage, their organizational structure, as well as the emergence of differential privacy for security use cases and if it's ready for prime time.

The complete show notes for this episode can be found at https://twimlai.com/go/453

Feb 04, 2021
Expressive Deep Learning with Magenta DDSP w/ Jesse Engel - #452
38:54

Today we’re joined by Jesse Engel, Staff Research Scientist at Google, working on the Magenta Project. 

In our conversation with Jesse, we explore the current landscape of creativity AI, and the role Magenta plays in helping express creativity through ML and deep learning. We dig deep into their Differentiable Digital Signal Processing (DDSP) library, which “lets you combine the interpretable structure of classical DSP elements (such as filters, oscillators, reverberation, etc.) with the expressivity of deep learning.”

Finally, Jesse walks us through some of the other projects that the Magenta team undertakes, including NLP and language modeling, and what he wants to see come out of the work that he and others are doing in creative AI research.

The complete show notes for this episode can be found at twimlai.com/go/452

Feb 01, 2021
Semantic Folding for Natural Language Understanding with Francisco Weber - #451
55:10

Today we’re joined by return guest Francisco Webber, CEO & Co-founder of Cortical.io.

Francisco was originally a guest over 4 years and 400 episodes ago, where we discussed his company Cortical.io, and their unique approach to natural language processing. In this conversation, Francisco gives us an update on Cortical, including their applications and toolkit, including semantic extraction, classifier, and search use cases. We also discuss GPT-3, and how it compares to semantic folding, the unreasonable amount of data needed to train these models, and the difference between the GPT approach and semantic modeling for language understanding.

The complete show notes for this episode can be found at twimlai.com/go/451.

Jan 29, 2021
The Future of Autonomous Systems with Gurdeep Pall - #450
52:50

Today we’re joined by Gurdeep Pall, Corporate Vice President at Microsoft.

Gurdeep, who we had the pleasure of speaking with on his 31st anniversary at the company, has had a hand in creating quite a few influential projects, including Skype for business (and Teams) and being apart of the first team that shipped wifi as a part of a general-purpose operating system.

In our conversation with Gurdeep, we discuss Microsoft’s acquisition of Bonsai and how they fit in the toolchain for creating brains for autonomous systems with “machine teaching,” and other practical applications of machine teaching in autonomous systems. We also explore the challenges of simulation, and how they’ve evolved to make the problems that the physical world brings more tenable. Finally, Gurdeep shares concrete use cases for autonomous systems, and how to get the best ROI on those investments, and of course, what’s next in the very broad space of autonomous systems.

The complete show notes for this episode can be found at twimlai.com/go/450.

Jan 25, 2021
AI for Ecology and Ecosystem Preservation with Bryan Carstens - #449
35:57

Today we’re joined by Bryan Carstens, a professor in the Department of Evolution, Ecology, and Organismal Biology & Head of the Tetrapod Division in the Museum of Biological Diversity at The Ohio State University.

In our conversation with Bryan, who comes from a traditional biology background, we cover a ton of ground, including a foundational layer of understanding for the vast known unknowns in species and biodiversity, and how he came to apply machine learning to his lab’s research.

We explore a few of his lab’s projects, including applying ML to genetic data to understand the geographic and environmental structure of DNA, what factors keep machine learning from being used more frequently used in biology, and what’s next for his group.

The complete show notes for this episode can be found at twimlai.com/go/449.

Jan 21, 2021
Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448
01:00:59

Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI.

In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace.

Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models.

The complete show notes for this episode can be found at twimlai.com/go/448.

Jan 18, 2021
A Future of Work for the Invisible Workers in A.I. with Saiph Savage - #447
39:56

Today we’re joined by Saiph Savage, a Visiting professor at the Human-Computer Interaction Institute at CMU, director of the HCI Lab at WVU, and co-director of the Civic Innovation Lab at UNAM.

We caught up with Saiph during NeurIPS where she delivered an insightful invited talk “A Future of Work for the Invisible Workers in A.I.”. In our conversation with Saiph, we gain a better understanding of the “Invisible workers,” or the people doing the work of labeling for machine learning and AI systems, and some of the issues around lack of economic empowerment, emotional trauma, and other issues that arise with these jobs.

We discuss ways that we can empower these workers, and push the companies that are employing these workers to do the same. Finally, we discuss Saiph’s participatory design work with rural workers in the global south.

The complete show notes for this episode can be found at twimlai.com/go/447.

Jan 14, 2021
Trends in Graph Machine Learning with Michael Bronstein - #446
01:10:52

Today we’re back with the final episode of AI Rewind joined by Michael Bronstein, a professor at Imperial College London and the Head of Graph Machine Learning at Twitter.

In our conversation with Michael, we touch on his thoughts about the year in Machine Learning overall, including GPT-3 and Implicit Neural Representations, but spend a major chunk of time on the sub-field of Graph Machine Learning. 

We talk through the application of Graph ML across domains like physics and bioinformatics, and the tools to look out for. Finally, we discuss what Michael thinks is in store for 2021, including graph ml applied to molecule discovery and non-human communication translation.

Jan 11, 2021
Trends in Natural Language Processing with Sameer Singh - #445
01:22:51

Today we continue the 2020 AI Rewind series, joined by friend of the show Sameer Singh, an Assistant Professor in the Department of Computer Science at UC Irvine. 

We last spoke with Sameer at our Natural Language Processing office hours back at TWIMLfest, and was the perfect person to help us break down 2020 in NLP. Sameer tackles the review in 4 main categories, Massive Language Modeling, Fundamental Problems with Language Models, Practical Vulnerabilities with Language Models, and Evaluation. 

We also explore the impact of GPT-3 and Transformer models, the intersection of vision and language models, and the injection of causal thinking and modeling into language models, and much more.

The complete show notes for this episode can be found at twimlai.com/go/445.

Jan 07, 2021
Trends in Computer Vision with Pavan Turaga - #444
01:10:29

AI Rewind continues today as we’re joined by Pavan Turaga, Associate Professor in both the Departments of Arts, Media, and Engineering & Electrical Engineering, and the Interim Director of the School of Arts, Media, and Engineering at Arizona State University.

Pavan, who joined us back in June to talk through his work from CVPR ‘20, Invariance, Geometry and Deep Neural Networks, is back to walk us through the trends he’s seen in Computer Vision last year. We explore the revival of physics-based thinking about scenes, differential rendering, the best papers, and where the field is going in the near future.

We want to hear from you! Send your thoughts on the year that was 2020 below in the comments, or via Twitter at @samcharrington or @twimlai.

The complete show notes for this episode can be found at twimlai.com/go/444

Jan 04, 2021
Trends in Reinforcement Learning with Pablo Samuel Castro - #443
01:26:38

Today we kick off our annual AI Rewind series joined by friend of the show Pablo Samuel Castro, a Staff Research Software Developer at Google Brain.

Pablo joined us earlier this year for a discussion about Music & AI, and his Geometric Perspective on Reinforcement Learning, as well our RL office hours during the inaugural TWIMLfest. In today’s conversation, we explore some of the latest and greatest RL advancements coming out of the major conferences this year, broken down into a few major themes, Metrics/Representations, Understanding and Evaluating Deep Reinforcement Learning, and RL in the Real World.

This was a very fun conversation, and we encourage you to check out all the great papers and other resources available on the show notes page.

Dec 30, 2020
MOReL: Model-Based Offline Reinforcement Learning with Aravind Rajeswaran - #442
37:05

Today we close out our NeurIPS series joined by Aravind Rajeswaran, a PhD Student in machine learning and robotics at the University of Washington.

At NeurIPS, Aravind presented his paper MOReL: Model-Based Offline Reinforcement Learning. In our conversation, we explore model-based reinforcement learning, and if models are a “prerequisite” to achieve something analogous to transfer learning. We also dig into MOReL and the recent progress in offline reinforcement learning, the differences in developing MOReL models and traditional RL models, and the theoretical results they’re seeing from this research.

The complete show notes for this episode can be found at twimlai.com/go/442

Dec 28, 2020
Machine Learning as a Software Engineering Enterprise with Charles Isbell - #441
45:22

As we continue our NeurIPS 2020 series, we’re joined by friend-of-the-show Charles Isbell, Dean, John P. Imlay, Jr. Chair, and professor at the Georgia Tech College of Computing.

This year Charles gave an Invited Talk at this year’s conference, You Can’t Escape Hyperparameters and Latent Variables: Machine Learning as a Software Engineering Enterprise. In our conversation, we explore the success of the Georgia Tech Online Masters program in CS, which now has over 11k students enrolled, and the importance of making the education accessible to as many people as possible. We spend quite a bit speaking about the impact machine learning is beginning to have on the world, and how we should move from thinking of ourselves as compiler hackers, and begin to see the possibilities and opportunities that have been ignored.

We also touch on the fallout from Timnit Gebru being “resignated” and the importance of having diverse voices and different perspectives “in the room,” and what the future holds for machine learning as a discipline.

The complete show notes for this episode can be found at twimlai.com/go/441

Dec 23, 2020
Natural Graph Networks with Taco Cohen - #440
57:25

Today we kick off our NeurIPS 2020 series joined by Taco Cohen, a Machine Learning Researcher at Qualcomm Technologies.

In our conversation with Taco, we discuss his current research in equivariant networks and video compression using generative models, as well as his paper “Natural Graph Networks,” which explores the concept of “naturality, a generalization of equivariance” which suggests that weaker constraints will allow for a “wider class of architectures.”

We also discuss some of Taco’s recent research on neural compression and a very interesting visual demo for equivariance CNNs that Taco and the Qualcomm team released during the conference.

The complete show notes for this episode can be found at twimlai.com/go/440.

Dec 21, 2020
Productionizing Time-Series Workloads at Siemens Energy with Edgar Bahilo Rodriguez - #439
42:22

Today we close out our re:Invent series joined by Edgar Bahilo Rodriguez, Lead Data Scientist in the industrial applications division of Siemens Energy.

Edgar spoke at this year's re:Invent conference about Productionizing R Workloads, and the resurrection of R for machine learning and productionalization. In our conversation with Edgar, we explore the fundamentals of building a strong machine learning infrastructure, and how they’re breaking down applications and using mixed technologies to build models.

We also discuss their industrial applications, including wind, power production management, managing systems intent on decreasing the environmental impact of pre-existing installations, and their extensive use of time-series forecasting across these use cases.

The complete show notes can be found at twimlai.com/go/439.

Dec 18, 2020
ML Feature Store at Intuit with Srivathsan Canchi - #438
41:59

Today we continue our re:Invent series with Srivathsan Canchi, Head of Engineering for the Machine Learning Platform team at Intuit. 

As we teased earlier this week, one of the major announcements coming from AWS at re:Invent was the release of the SageMaker Feature Store. To our pleasant surprise, we came to learn that our friends at Intuit are the original architects of this offering and partnered with AWS to productize it at a much broader scale. In our conversation with Srivathsan, we explore the focus areas that are supported by the Intuit machine learning platform across various teams, including QuickBooks and Mint, Turbotax, and Credit Karma,  and his thoughts on why companies should be investing in feature stores. 

We also discuss why the concept of “feature store” has seemingly exploded in the last year, and how you know when your organization is ready to deploy one. Finally, we dig into the specifics of the feature store, including the popularity of graphQL and why they chose to include it in their pipelines, the similarities (and differences) between the two versions of the store, and much more!

The complete show notes for this episode can be found at twimlai.com/go/438.

Dec 16, 2020
re:Invent Roundup 2020 with Swami Sivasubramanian - #437
49:39

Today we’re kicking off our annual re:invent series joined by Swami Sivasubramanian, VP of Artificial Intelligence, at AWS.

During re:Invent last week, Amazon made a ton of announcements on the machine learning front, including quite a few advancements to SageMaker. In this roundup conversation, we discuss the motivation for hosting the first-ever machine learning keynote at the conference, a bunch of details surrounding tools like Pipelines for workflow management, Clarify for bias detection, and JumpStart for easy to use algorithms and notebooks, and many more.

We also discuss the emphasis placed on DevOps and MLOps tools in these announcements, and how the tools are all interconnected. Finally, we briefly touch on the announcement of the AWS feature store, but be sure to check back later this week for a more in-depth discussion on that particular release!

The complete show notes for this episode can be found at twimlai.com/go/437.

Dec 14, 2020
Predictive Disease Risk Modeling at 23andMe with Subarna Sinha - #436
39:52

Today we’re joined by Subarna Sinha, Machine Learning Engineering Leader at 23andMe.

23andMe handles a massive amount of genomic data every year from its core ancestry business but also uses that data for disease prediction, which is the core use case we discuss in our conversation.

Subarna talks us through an initial use case of creating an evaluation of polygenic scores, and how that led them to build an ML pipeline and platform. We talk through the tools and tech stack used for the operationalization of their platform, the use of synthetic data, the internal pushback that came along with the changes that were being made, and what’s next for her team and the platform.

The complete show notes for this episode can be found at twimlai.com/go/436.

Dec 11, 2020
Scaling Video AI at RTL with Daan Odijk - #435
40:36

Today we’re joined by Daan Odijk, Data Science Manager at RTL.

In our conversation with Daan, we explore the RTL MLOps journey, and their need to put platform infrastructure in place for ad optimization and forecasting, personalization, and content understanding use cases. Daan walks us through some of the challenges on both the modeling and engineering sides of building the platform, as well as the inherent challenges of video applications.

Finally, we discuss the current state of their platform, and the benefits they’ve seen from having this infrastructure in place, and why using building a custom platform was worth the investment.

The complete show notes for this episode can be found at twimlai.com/go/435. 

Dec 09, 2020
Benchmarking ML with MLCommons w/ Peter Mattson - #434
46:13

Today we’re joined by Peter Mattson, General Chair at MLPerf, a Staff Engineer at Google, and President of MLCommons. 

In our conversation with Peter, we discuss MLCommons and MLPerf, the former an open engineering group with the goal of accelerating machine learning innovation, and the latter a set of standardized Machine Learning speed benchmarks used to measure things like model training speed, throughput speed for inference. 

We explore the target user for the MLPerf benchmarks, the need for benchmarks in the ethics, bias, fairness space, and how they’re approaching this through the "People’s Speech" datasets. We also walk through the MLCommons best practices of getting a model into production, why it's so difficult, and how MLCube can make the process easier for researchers and developers.

The complete show notes page for this episode can be found at twimlai.com/go/434.

Dec 07, 2020
Deep Learning for NLP: From the Trenches with Charlene Chambliss - #433
46:21

Today we’re joined by Charlene Chambliss, Machine Learning Engineer at Primer AI. 

Charlene, who we also had the pleasure of hosting at NLP Office Hours during TWIMLfest, is back to share some of the work she’s been doing with NLP. In our conversation, we explore her experiences working with newer NLP models and tools like BERT and HuggingFace, as well as whats she’s learned along the way with word embeddings, labeling tasks, debugging, and more. We also focus on a few of her projects, like her popular multi-lingual BERT project, and a COVID-19 classifier. 

Finally, Charlene shares her experience getting into data science and machine learning coming from a non-technical background, and what the transition was like, and tips for people looking to make a similar shift.

Dec 03, 2020
Feature Stores for Accelerating AI Development - #432
57:00

In this special episode of the podcast, we're joined by Kevin Stumpf, Co-Founder and CTO of Tecton, Willem Pienaar, an engineering lead at Gojek and founder of the Feast Project, and Maxime Beauchemin, Founder & CEO of Preset, for a discussion on Feature Stores for Accelerating AI Development.

In this panel discussion, Sam and our guests explored how organizations can increase value and decrease time-to-market for machine learning using feature stores, MLOps, and open source. We also discuss the main data challenges of AI/ML, and the role of the feature store in solving those challenges.

The complete show notes for this episode can be found at twimlai.com/go/432.

Nov 30, 2020
An Exploration of Coded Bias with Shalini Kantayya, Deb Raji and Meredith Broussard - #431
01:25:10

In this special edition of the podcast, we're joined by Shalini Kantayya, the director of Coded Bias, and Deb Raji and Meredith Broussard, who both contributed to the film.

In this panel discussion, Sam and our guests explored the societal implications of the biases embedded within AI algorithms. The conversation discussed examples of AI systems with disparate impact across industries and communities, what can be done to mitigate this disparity, and opportunities to get involved.

Our panelists Shalini, Meredith, and Deb each share insight into their experience working on and researching bias in AI systems and the oppressive and dehumanizing impact they can have on people in the real world.


The complete show notes for this film can be found at twimlai.com/go/431

Nov 27, 2020
Common Sense as an Algorithmic Framework with Dileep George - #430
49:01

Today we’re joined by Dileep George, Founder and the CTO of Vicarious.

Dileep, who was also a co-founder of Numenta, works at the intersection of AI research and neuroscience, and famously pioneered the hierarchical temporal memory. In our conversation, we explore the importance of mimicking the brain when looking to achieve artificial general intelligence, the nuance of “language understanding” and how all the tasks that fall underneath it are all interconnected, with or without language.

We also discuss his work with Recursive Cortical Networks, Schema Networks, and what’s next on the path towards AGI!

Nov 23, 2020
Scaling Enterprise ML in 2020: Still Hard! with Sushil Thomas - #429
48:26

Today we’re joined by Sushil Thomas, VP of Engineering for Machine Learning at Cloudera.

Over the summer, I had the pleasure of hosting Sushil and a handful of business leaders across industries at the Cloudera Virtual Roundtable. In this conversation with Sushil, we recap the roundtable, exploring some of the topics discussed and insights gained from those conversations. Sushil gives us a look at how COVID19 has impacted business throughout the year, and how the pandemic is shaping enterprise decision making moving forward. 

We also discuss some of the key trends he’s seeing as organizations try to scale their machine learning and AI efforts, including understanding best practices, and learning how to hybridize the engineering side of ML with the scientific exploration of the tasks. Finally, we explore if organizational models like hub vs centralized are still organization-specific or if that’s changed in recent years, as well as how to get and retain good ML talent with giant companies like Google and Microsoft looming large.

The complete show notes for this episode can be found at https://twimlai.com/go/429.

Nov 19, 2020
Enabling Clinical Automation: From Research to Deployment with Devin Singh - #428
43:46

Today we’re joined by Devin Singh, a Physician Lead for Clinical Artificial Intelligence & Machine Learning in Pediatric Emergency Medicine at the Hospital for Sick Children (SickKids) in Toronto, and Founder and CEO of HeroAI.

In our conversation with Devin, we discuss some of the interesting ways that Devin is deploying machine learning within the SickKids hospital, the current structure of academic research, including how much research and publications are currently being incentivized, how little of those research projects actually make it to deployment, and how Devin is working to flip that system on it's head. 

We also talk about his work at Hero AI, where he is commercializing and deploying his academic research to build out infrastructure and deploy AI solutions within hospitals, creating an automated pipeline with patients, caregivers, and EHS companies. Finally, we discuss Devins's thoughts on how he’d approach bias mitigation in these systems, and the importance of having proper stakeholder engagement and using design methodology when building ML systems.

The complete show notes for this episode can be found at twimlai.com/go/428.

Nov 16, 2020
Pixels to Concepts with Backpropagation w/ Roland Memisevic - #427
35:33

Today we’re joined by Roland Memisevic, return podcast guest and Co-Founder & CEO of Twenty Billion Neurons. 

We last spoke to Roland in 2018, and just earlier this year TwentyBN made a sharp pivot to a surprising use case, a companion app called Fitness Ally, an interactive, personalized fitness coach on your phone. 

In our conversation with Roland, we explore the progress TwentyBN has made on their goal of training deep neural networks to understand physical movement and exercise. We also discuss how they’ve taken their research on understanding video context and awareness and applied it in their app, including how recent advancements have allowed them to deploy their neural net locally while preserving privacy, and Roland’s thoughts on the enormous opportunity that lies in the merging of language and video processing.

The complete show notes for this episode can be found at twimlai.com/go/427.

Nov 12, 2020
Fighting Global Health Disparities with AI w/ Jon Wang - #426
36:03

Today we’re joined by Jon Wang, a medical student at UCSF, and former Gates Scholar and AI researcher at the Bill and Melinda Gates Foundation.

In our conversation with Jon, we explore a few of the different ways he’s attacking various public health issues, including improving the electronic health records system through automating clinical order sets, and exploring how the lack of literature and AI talent in the non-profit and healthcare spaces, and bad data have further marginalized undersupported communities.

We also discuss his work at the Gates Foundation, which included understanding how AI can be helpful in lower-resource and lower-income countries, and building digital infrastructure, and much more.

The complete show notes for this episode can be found at twimlai.com/go/426.

 

Nov 09, 2020
Accessibility and Computer Vision - #425
01:01:03

Digital imagery is pervasive today. More than a billion images per day are produced and uploaded to social media sites, with many more embedded within websites, apps, digital documents, and eBooks. Engaging with digital imagery has become fundamental to participating in contemporary society, including education, the professions, e-commerce, civics, entertainment, and social interactions.

However, most digital images remain inaccessible to the 39 million people worldwide who are blind. AI and computer vision technologies hold the potential to increase image accessibility for people who are blind, through technologies like automated image descriptions.

The speakers share their perspectives as people who are both technology experts and are blind, providing insight into future directions for the field of computer vision for describing images and videos for people who are blind.

To check out the video of this panel, visit here!

The complete show notes for this episode can be found at twimlai.com/go/425

Nov 05, 2020
NLP for Equity Investing with Frank Zhao - #424
45:43

Today we’re joined by Frank Zhao, Senior Director of Quantamental Research at S&P Global Market Intelligence.

In our conversation with Frank, we explore how he came to work at the intersection of ML and finance, and how he navigates the relationship between data science and domain expertise. We also discuss the rise of data science in the investment management space, examining the largely under-explored technique of using unstructured data to gain insights into equity investing, and the edge it can provide for investors.

Finally, Frank gives us a look at how he uses natural language processing with textual data of earnings call transcripts and walks us through the entire pipeline.

The complete show notes for this episode can be found at twimlai.com/go/424.

Nov 02, 2020
The Future of Education and AI with Salman Khan - #423
47:33

In the final #TWIMLfest Keynote Interview, we’re joined by Salman Khan, Founder of Khan Academy.

In our conversation with Sal, we explore the amazing origin story of the academy, and how coronavirus is shaping the future of education and remote and distance learning, for better and for worse. We also explore Sal’s perspective on machine learning and AI being used broadly in education, the potential of injecting a platform like Khan Academy with ML and AI for course recommendations, and if they’re planning on implementing these features in the future.

Finally, Sal shares some great stories about the impact of community and opportunity, and what advice he has for learners within the TWIML community and beyond!

The complete show notes for this episode can be found at twimlai.com/go/423.

Oct 28, 2020
Why AI Innovation and Social Impact Go Hand in Hand with Milind Tambe - #422
34:06

In this special #TWIMLfest Keynote episode, we’re joined by Milind Tambe, Director of AI for Social Good at Google Research India, and Director of the Center for Research in Computation and Society (CRCS) at Harvard University.

In our conversation, we explore Milind’s various research interests, most of which fall under the umbrella of AI for Social Impact, including his work in public health, both stateside and abroad, his conservation work in South Asia and Africa, and his thoughts on the ways that those interested in social impact can get involved. 

The complete show notes for this episode can be found at twimlai.com/go/422.

Oct 23, 2020
What's Next for Fast.ai? w/ Jeremy Howard - #421
57:44

In this special #TWIMLfest episode of the podcast, we’re joined by Jeremy Howard, Founder of Fast.ai.

In our conversation with Jeremy, we discuss his career path, including his journey through the consulting world and how those experiences led him down the path to ML education, his thoughts on the current state of the machine learning adoption cycle, and if we’re at maximum capacity for deep learning use and capability.

Of course, we dig into the newest version of the fast.ai framework and course, the reception of Jeremy’s book ‘Deep Learning for Coders with Fastai and PyTorch: AI Applications Without a PhD,’ and what’s missing from the machine learning education landscape. If you’ve missed our previous conversations with Jeremy, I encourage you to check them out here and here.

The complete show notes for this episode can be found at https://twimlai.com/go/421.

Oct 21, 2020
Feature Stores for MLOps with Mike del Balso - #420
46:39

Today we’re joined by Mike del Balso, co-Founder and CEO of Tecton. 

Mike, who you might remember from our last conversation on the podcast, was a foundational member of the Uber team that created their ML platform, Michelangelo. Since his departure from the company in 2018, he has been busy building up Tecton, and their enterprise feature store. 

In our conversation, Mike walks us through why he chose to focus on the feature store aspects of the machine learning platform, the journey, personal and otherwise, to operationalizing machine learning, and the capabilities that more mature platforms teams tend to look for or need to build. We also explore the differences between standalone components and feature stores, if organizations are taking their existing databases and building feature stores with them, and what a dynamic, always available feature store looks like in deployment. 

Finally, we explore what sets Tecton apart from other vendors in this space, including enterprise cloud providers who are throwing their hat in the ring.

The complete show notes for this episode can be found at twimlai.com/go/420.

Thanks to our friends at Tecton for sponsoring this episode of the podcast! Find out more about what they're up to at tecton.ai.

Oct 19, 2020
Exploring Causality and Community with Suzana Ilić - #419
52:05

In this special #TWIMLfest episode, we’re joined by Suzana Ilić, a computational linguist at Causaly and founder of Machine Learning Tokyo (MLT).

Suzana joined us as a keynote speaker to discuss the origins of the MLT community, but we cover a lot of ground in this conversation. We briefly discuss Suzana’s work at Causaly, touching on her experiences transitioning from linguist and domain expert to working with causal modeling, balancing her role as both product manager and leader of the development team for their causality extraction module, and the unique ways that she thinks about UI in relation to their product.

We also spend quite a bit of time exploring MLT, including how they’ve achieved exponential growth within the community over the past few years and when Suzana knew MLT was moving beyond just a personal endeavor, her experiences publishing papers at major ML conferences as an independent organization, and inspires her within the broader ML/AI Community. And of course, we answer quite a few great questions from our live audience!

Oct 16, 2020
Decolonizing AI with Shakir Mohamed - #418
52:08

In this special #TWIMLfest edition of the podcast, we’re joined by Shakir Mohamed, a Senior Research Scientist at DeepMind.

Shakir is also a leader of Deep Learning Indaba, a non-profit organization whose mission is to Strengthen African Machine Learning and Artificial Intelligence. In our conversation with Shakir, we discuss his recent paper ‘Decolonial AI,’ the distinction between decolonizing AI and ethical AI, while also exploring the origin of the Indaba, the phases of community, and much more.

The complete show notes for this episode can be found at twimlai.com/go/418.

Oct 14, 2020
Spatial Analysis for Real-Time Video Processing with Adina Trufinescu
40:57

Today we’re joined by Adina Trufinescu, Principal Program Manager at Microsoft, to discuss some of the computer vision updates announced at Ignite 2020. 

We focus on the technical innovations that went into their recently announced spatial analysis software, and the software’s use cases including the movement of people within spaces, distance measurements (social distancing), and more. 

We also discuss the ‘responsible AI guidelines’ put in place to curb bad actors potentially using this software for surveillance, what techniques are being used to do object detection and image classification, and the challenges to productizing this research. 

The complete show notes for this episode can be found at twimlai.com/go/417.

Oct 08, 2020
How Deep Learning has Revolutionized OCR with Cha Zhang - #416
58:41

Today we’re joined by Cha Zhang, a Partner Engineering Manager at Microsoft Cloud & AI. 

Cha’s work at MSFT is focused on exploring ways that new technologies can be applied to optical character recognition, or OCR, pushing the boundaries of what has been seen as an otherwise ‘solved’ problem. In our conversation with Cha, we explore some of the traditional challenges of doing OCR in the wild, and what are the ways in which deep learning algorithms are being applied to transform these solutions. 

We also discuss the difficulties of using an end to end pipeline for OCR work, if there is a semi-supervised framing that could be used for OCR, the role of techniques like neural architecture search, how advances in NLP could influence the advancement of OCR problems, and much more. 

The complete show notes for this episode can be found at twimlai.com/go/416.

Oct 05, 2020
Machine Learning for Food Delivery at Global Scale - #415
56:40
In this special edition of the show, we discuss the various ways in which machine learning plays a role in helping businesses overcome their challenges in the food delivery space. 
A few weeks ago Sam had the opportunity to moderate a panel at the Prosus AI Marketplace virtual event with Sandor Caetano of iFood, Dale Vaz of Swiggy, Nicolas Guenon of Delivery Hero, and Euro Beinat of Prosus. 
In this conversation, panelists describe the application of machine learning to a variety of business use cases, including how they deliver recommendations, the unique ways they handle the logistics of deliveries, and fraud and abuse prevention. 

The complete show notes for this episode can be found at twimlai.com/go/415.

Oct 02, 2020
Open Source at Qualcomm AI Research with Jeff Gehlhaar and Zahra Koochak - #414
44:06

Today we're joined by Jeff Gehlhaar, VP of Technology at Qualcomm, and Zahra Koochak, Staff Machine Learning Engineer at Qualcomm AI Research. 

If you haven’t had a chance to listen to our first interview with Jeff, I encourage you to check it out here! In this conversation, we catch up with Jeff and Zahra to get an update on what the company has up to since our last conversation, including the Snapdragon 865 chipset and Hexagon Neural Network Direct. 

We also discuss open-source projects like the AI efficiency toolkit and Tensor Virtual Machine compiler, and how these projects fit in the broader Qualcomm ecosystem. Finally, we talk through their vision for on-device federated learning. 

The complete show notes for this page can be found at twimlai.com/go/414.

Sep 30, 2020
Visualizing Climate Impact with GANs w/ Sasha Luccioni - #413
42:02

Today we’re joined by Sasha Luccioni, a Postdoctoral Researcher at the MILA Institute, and moderator of our upcoming TWIMLfest Panel, ‘Machine Learning in the Fight Against Climate Change.’ 

We were first introduced to Sasha’s work through her paper on ‘Visualizing The Consequences Of Climate Change Using Cycle-consistent Adversarial Networks’, and we’re excited to pick her brain about the ways ML is currently being leveraged to help the environment. In our conversation, we explore the use of GANs to visualize the consequences of climate change, the evolution of different approaches she used, and the challenges of training GANs using an end-to-end pipeline.

Finally, we talk through Sasha’s goals for the aforementioned panel, which is scheduled for Friday, October 23rd at 1 pm PT. Register for all of the great TWIMLfest sessions at twimlfest.com!

The complete show notes for this episode can be found at twimlai.com/go/413.

Sep 28, 2020
ML-Powered Language Learning at Duolingo with Burr Settles - #412
54:51

Today we’re joined by Burr Settles, Research Director at Duolingo. Most would acknowledge that one of the most effective ways to learn is one on one with a tutor, and Duolingo’s main goal is to replicate that at scale.

In our conversation with Burr, we dig how the business model has changed over time, the properties that make a good tutor, and how those features translate to the AI tutor they’ve built. We also discuss the Duolingo English Test, and the challenges they’ve faced with maintaining the platform while adding languages and courses.

Check out the complete show notes for this episode at twimlai.com/go/412.

Sep 24, 2020
Bridging The Gap Between Machine Learning and the Life Sciences with Artur Yakimovich - #411
39:53

Today we’re joined by Artur Yakimovich, Co-Founder at Artificial Intelligence for Life Sciences and a visiting scientist in the Lab for Molecular Cell Biology at University College London. In our conversation with Artur, we explore the gulf that exists between life science researchers and the tools and applications used by computer scientists. 

While Artur’s background is in viral chemistry, he has since transitioned to a career in computational biology to “see where chemistry stopped, and biology started.” We discuss his work in that middle ground, looking at quite a few of his recent work applying deep learning and advanced neural networks like capsule networks to his research problems. 

Finally, we discuss his efforts building the Artificial Intelligence for Life Sciences community, a non-profit organization he founded to bring scientists from different fields together to share ideas and solve interdisciplinary problems. 

Check out the complete show notes at twimlai.com/go/411.

Sep 21, 2020
Understanding Cultural Style Trends with Computer Vision w/ Kavita Bala - #410
38:48

Today we’re joined by Kavita Bala, the Dean of Computing and Information Science at Cornell University. 

Kavita, whose research explores the overlap of computer vision and computer graphics, joined us to discuss a few of her projects, including GrokStyle, a startup that was recently acquired by Facebook and is currently being deployed across their Marketplace features. We also talk about StreetStyle/GeoStyle, projects focused on using social media data to find style clusters across the globe. 

Kavita shares her thoughts on the privacy and security implications, progress with integrating privacy-preserving techniques into vision projects like the ones she works on, and what’s next for Kavita’s research.

The complete show notes for this episode can be found at twimlai.com/go/410.

Sep 17, 2020
That's a VIBE: ML for Human Pose and Shape Estimation with Nikos Athanasiou, Muhammed Kocabas, Michael Black - #409
44:51

Today we’re joined by Nikos Athanasiou, Muhammed Kocabas, Ph.D. students, and Michael Black, Director of the Max Planck Institute for Intelligent Systems. 

We caught up with the group to explore their paper VIBE: Video Inference for Human Body Pose and Shape Estimation, which they submitted to CVPR 2020. In our conversation, we explore the problem that they’re trying to solve through an adversarial learning framework, the datasets (AMASS) that they’re building upon, the core elements that separate this work from its predecessors in this area of research, and the results they’ve seen through their experiments and testing.

 The complete show notes for this episode can be found at https://twimlai.com/go/409.

Register for TWIMLfest today!

Sep 14, 2020
3D Deep Learning with PyTorch 3D w/ Georgia Gkioxari - #408
36:46

Today we’re joined by Georgia Gkioxari, a research scientist at Facebook AI Research. 

Georgia was hand-picked by the TWIML community to discuss her work on the recently released open-source library PyTorch3D. In our conversation, Georgia describes her experiences as a computer vision researcher prior to the 2012 deep learning explosion, and how the entire landscape has changed since then. 

Georgia walks us through the user experience of PyTorch3D, while also detailing who the target audience is, why the library is useful, and how it fits in the broad goal of giving computers better means of perception. Finally, Georgia gives us a look at what it’s like to be a co-chair for CVPR 2021 and the challenges with updating the peer review process for the larger academic conferences. 

The complete show notes for this episode can be found at twimlai.com/go/408.

Sep 10, 2020
What are the Implications of Algorithmic Thinking? with Michael I. Jordan - #407
57:27

Today we’re joined by the legendary Michael I. Jordan, Distinguished Professor in the Departments of EECS and Statistics at UC Berkeley. 

Michael was gracious enough to connect us all the way from Italy after being named IEEE’s 2020 John von Neumann Medal recipient. In our conversation with Michael, we explore his career path, and how his influence from other fields like philosophy shaped his path. 

We spend quite a bit of time discussing his current exploration into the intersection of economics and AI, and how machine learning systems could be used to create value and empowerment across many industries through “markets.” We also touch on the potential of “interacting learning systems” at scale, the valuation of data, the commoditization of human knowledge into computational systems, and much, much more.

The complete show notes for this episode can be found at. twimlai.com/go/407.

Sep 07, 2020
Beyond Accuracy: Behavioral Testing of NLP Models with Sameer Singh - #406
41:11

Today we’re joined by Sameer Singh, an assistant professor in the department of computer science at UC Irvine. 

Sameer’s work centers on large-scale and interpretable machine learning applied to information extraction and natural language processing. We caught up with Sameer right after he was awarded the best paper award at ACL 2020 for his work on Beyond Accuracy: Behavioral Testing of NLP Models with CheckList.

In our conversation, we explore CheckLists, the task-agnostic methodology for testing NLP models introduced in the paper. We also discuss how well we understand the cause of pitfalls or failure modes in deep learning models, Sameer’s thoughts on embodied AI, and his work on the now famous LIME paper, which he co-authored alongside Carlos Guestrin. 

The complete show notes for this episode can be found at twimlai.com/go/406.

Sep 03, 2020
How Machine Learning Powers On-Demand Logistics at Doordash with Gary Ren - #405
43:48

Today we’re joined by Gary Ren, a machine learning engineer for the logistics team at DoorDash. 

In our conversation, we explore how machine learning powers the entire logistics ecosystem. We discuss the stages of their “marketplace,” and how using ML for optimized route planning and matching affects consumers, dashers, and merchants. We also talk through how they use traditional mathematics, classical machine learning, potential use cases for reinforcement learning frameworks, and challenges to implementing these explorations.  

The complete show notes for this episode can be found at twimlai.com/go/405!

Check out our upcoming event at twimlai.com/twimlfest

Aug 31, 2020
Machine Learning as a Software Engineering Discipline with Dillon Erb - #404
44:39

Today we’re joined by Dillon Erb, Co-founder & CEO of Paperspace.

We’ve followed Paperspace since their origins offering GPU-enabled compute resources to data scientists and machine learning developers, to the release of their Jupyter-based Gradient service. Our conversation with Dillon centered on the challenges that organizations face building and scaling repeatable machine learning workflows, and how they’ve done this in their own platform by applying time-tested software engineering practices. 

We also discuss the importance of reproducibility in production machine learning pipelines, how the processes and tools of software engineering map to the machine learning workflow, and technical issues that ML teams run into when trying to scale the ML workflow.

The complete show notes for this episode can be found at twimlai.com/go/404.

Aug 27, 2020
AI and the Responsible Data Economy with Dawn Song - #403
52:17

Today we’re joined by Professor of Computer Science at UC Berkeley, Dawn Song. Dawn’s research is centered at the intersection of AI, deep learning, security, and privacy. She’s currently focused on bringing these disciplines together with her startup, Oasis Labs. 

In our conversation, we explore their goals of building a ‘platform for a responsible data economy,’ which would combine techniques like differential privacy, blockchain, and homomorphic encryption. The platform would give consumers more control of their data, and enable businesses to better utilize data in a privacy-preserving and responsible way. 

We also discuss how to privatize and anonymize data in language models like GPT-3, real-world examples of adversarial attacks and how to train against them, her work on program synthesis to get towards AGI, and her work on privatizing coronavirus contact tracing data.

The complete show notes for this episode can be found twimlai.com/go/403.

Aug 24, 2020
Relational, Object-Centric Agents for Completing Simulated Household Tasks with Wilka Carvalho - #402
41:04

Today we’re joined by Wilka Carvalho, a PhD student at the University of Michigan, Ann Arbor.

We first met Wilka at the Black in AI workshop at last year’s NeurIPS conference, and finally got a chance to catch up about his latest research, ‘ROMA: A Relational, Object-Model Learning Agent for Sample-Efficient Reinforcement Learning.’ In the paper, Wilka explores the challenge of object interaction tasks, focusing on every day, in-home functions like filling a cup of water in a sink. 

In our conversation, we discuss his interest in understanding the foundational building blocks of intelligence, how he’s addressing the challenge of ‘object-interaction’ tasks, the biggest obstacles he’s run into along the way.

The complete show notes for this episode can be found at twimlai.com/go/402.

Aug 20, 2020
Model Explainability Forum - #401
01:26:41

Today we’re bringing you the latest TWIML Discussion Series panel on Model Explainability. The use of machine learning in business, government, and other settings that require users to understand the model’s predictions has exploded in recent years. This growth, combined with the increased popularity of opaque ML models like deep learning, has led to the development of a thriving field of model explainability research and practice. 

In this panel discussion, we bring together experts and researchers to explore the current state of explainability and some of the key emerging ideas shaping the field. Each guest will share their unique perspective and contributions to thinking about model explainability in a practical way.

We explore concepts like stakeholder-driven explainability, adversarial attacks on explainability methods, counterfactual explanations, legal and policy implications, and more. We round out the session with an audience Q&A! Check out the list of resources below!

The complete show notes for this episode can be found at twimlai.com/go/401.

Aug 17, 2020
What NLP Tells Us About COVID-19 and Mental Health with Johannes Eichstaedt - #400
58:09

Today we’re joined by Johannes Eichstaedt, an Assistant Professor of Psychology at Stanford University. 

Johannes joined us at the outset of the coronavirus pandemic to discuss his use of Facebook and Twitter data to measure the psychological states of large populations and individuals. In our conversation, we explore how Johannes applies his physics background to a career as a computational social scientist, the differences in communication on social media vs the real world, and what language indicators point to changes in mental health. 

We also discuss some of the major patterns in the data that emerged over the first few months of lockdown, including mental health, social norms, and political patterns. We also explore how Johannes built the process, and the techniques he’s using to collect, sift through, and understand the data.

The complete show notes for this episode can be found at twimlai.com/go/400.

Aug 13, 2020
Human-AI Collaboration for Creativity with Devi Parikh - #399
44:56

Today we’re joined by Devi Parikh, Associate Professor at the School of Interactive Computing at Georgia Tech, and research scientist at Facebook AI Research (FAIR). 

While Devi’s work is more broadly focused on computer vision applications, we caught up to discuss her presentation on AI and Creativity at the CV for Fashion, Art and Design workshop at CVPR 2020. In our conversation, we touch on Devi’s definition of creativity,  explore multiple ways that AI could impact the creative process for artists, and help humans become more creative. We investigate tools like casual creator for preference prediction, neuro-symbolic generative art, and visual journaling. 

The complete show notes for this episode can be found at twimlai.com/talk/399.

A quick reminder that this is your last chance to register for tomorrow’s Model Explainability Forum! For more information, visit https://twimlai.com/explainabilityforum.

Aug 10, 2020
Neural Augmentation for Wireless Communication with Max Welling - #398
49:15

Today we’re joined by Max Welling, Vice President of Technologies at Qualcomm Netherlands, and Professor at the University of Amsterdam. In case you missed it, Max joined us last year to discuss his work on  Gauge Equivariant CNNs and Generative Models - the 2nd most popular episode of 2019. 

In this conversation, we explore the concept and Max’s work in neural augmentation, and how it’s being deployed for channel tracking and other applications. We also discuss their current work on federated learning and incorporating the technology on devices to give users more control over the privacy of their personal data. Max also shares his thoughts on quantum mechanics and the future of quantum neural networks for chip design.

The complete show notes for this episode can be found at twimlai.com/talk/398.

This episode is sponsored by Qualcomm Technologies.

Aug 06, 2020
Quantum Machine Learning: The Next Frontier? with Iordanis Kerenidis - #397
01:01:36

Today we conclude our 2020 ICML coverage joined by Iordanis Kerenidis, Research Director at Centre National de la Recherche Scientifique (CNRS) in Paris, and Head of Quantum Algorithms at QC Ware.

Iordanis’ research centers around quantum algorithms of machine learning, and was an ICML main conference Keynote speaker on the topic! We focus our conversation on his presentation, exploring the prospects and challenges of quantum machine learning, as well as the field’s history, evolution, and future. We’ll also discuss the foundations of quantum computing, and some of the challenges to consider for breaking into the field.

The complete show notes for this episode can be found at twimlai.com/talk/397. For complete ICML series details, visit twimlai.com/icml20.

Aug 04, 2020
ML and Epidemiology with Elaine Nsoesie - #396
48:31

Today we continue our ICML series with Elaine Nsoesie, assistant professor at Boston University. 

Elaine presented a keynote talk at the ML for Global Health workshop at ICML 2020, where she shared her research centered around data-driven epidemiology. In our conversation, we discuss the different ways that machine learning applications can be used to address global health issues, including use cases like infectious disease surveillance via hospital parking lot capacity, and tracking search data for changes in health behavior in African countries. We also discuss COVID-19 epidemiology, focusing on the importance of recognizing how the disease is affecting people of different races, ethnicities, and economic backgrounds.

To follow along with our 2020 ICML Series, visit twimlai.com/icml20. The complete show notes for this episode can be found at twimali.com/talk/396.

Jul 30, 2020
Language (Technology) Is Power: Exploring the Inherent Complexity of NLP Systems with Hal Daumé III - #395
01:04:43

Today we’re joined by Hal Daume III, professor at the University of Maryland, Senior Principal Researcher at Microsoft Research, and Co-Chair of the 2020 ICML Conference. 

We had the pleasure of catching up with Hal ahead of this year's ICML to discuss his research at the intersection of bias, fairness, NLP, and the effects language has on machine learning models. 

We explore language in two categories as they appear in machine learning models and systems: (1) How we use language to interact with the world, and (2) how we “do” language. We also discuss ways to better incorporate domain experts into ML system development, and Hal’s experience as ICML Co-Chair.

Follow along with our ICML coverage at twimlai.com/icml20. The complete show notes for this episode can be found at twimlai.com/talk/395.

Jul 27, 2020
Graph ML Research at Twitter with Michael Bronstein - #394
56:37

Today we’re excited to be joined by return guest Michael Bronstein, Professor at Imperial College London, and Head of Graph Machine Learning at Twitter. We last spoke with Michael at NeurIPS in 2017 about Geometric Deep Learning

Since then, his research focus has slightly shifted to exploring graph neural networks. In our conversation, we discuss the evolution of the graph machine learning space, contextualizing Michael’s work on geometric deep learning and research on non-euclidian unstructured data. We also talk about his new role at Twitter and some of the research challenges he’s faced, including scalability and working with dynamic graphs. Michael also dives into his work on differential graph modules for graph CNNs, and the various applications of this work.

The complete show notes for this episode can be found at twimlai.com/talk/394.

Jul 23, 2020
Panel: The Great ML Language (Un)Debate! - #393
01:33:08

Today we’re excited to bring The Great ML Language (Un)Debate’ to the podcast! In the latest edition of our series of live discussions, we brought together experts and enthusiasts representing an array of both popular and emerging programming languages for machine learning. In the discussion, we explored the strengths, weaknesses, and approaches offered by Clojure, JavaScript, Julia, Probabilistic Programming, Python, R, Scala, and Swift. We round out the session with an audience Q&A (58:28), covering topics including favorite secondary languages, what languages pair well, quite a few questions about C++, and much more. 

Head over to twimlai.com/talk/393 for more information about our panelists!

Jul 20, 2020
What the Data Tells Us About COVID-19 with Eric Topol - #392
41:36

Today we’re joined by Eric Topol, Director & Founder of the Scripps Research Translational Institute, and author of the book Deep Medicine. 

Eric is also one of the most trusted voices on the COVID-19 pandemic, giving those that follow his Twitter account (link) daily updates on the disease and its impact, from both a biological and public health perspective. We had the pleasure of catching up with Eric to talk through several Coronavirus-related topics, including what we’ve learned since the pandemic began and the role of technology—including ML and AI—in understanding and preventing the spread of the disease. We also explore the broader opportunity for medical applications of AI, the promise they offer for personalized medicine, and how techniques like federated learning and homomorphic encryption can offer more privacy in healthcare.  

The complete show notes for this episode can be found at twimlai.com/talk/392.

Jul 16, 2020
The Case for Hardware-ML Model Co-design with Diana Marculescu - #391
44:58

Today we’re joined by Diana Marculescu, Department Chair and Professor of Electrical and Computer Engineering at University of Texas at Austin. 

We caught up with Diana to discuss her work on hardware-aware machine learning. In particular, we explore her keynote, “Putting the “Machine” Back in Machine Learning: The Case for Hardware-ML Model Co-design” from the Efficient Deep Learning in Computer Vision workshop at this year’s CVPR conference. 

In our conversation, we explore how her research group is focusing on making ML models more efficient so that they run better on current hardware systems, and what components and techniques they’re using to achieve true co-design. We also discuss her work with Neural architecture search, how this fits into the edge vs cloud conversation, and her thoughts on the longevity of deep learning research. 

The complete show notes for this episode can be found at twimlai.com/talk/391.

Jul 13, 2020
Computer Vision for Remote AR with Flora Tasse - #390
40:54

Today we conclude our CVPR coverage joined by Flora Tasse, Head of Computer Vision & AI Research at Streem. 

Flora, a keynote speaker at the AR/VR workshop at CVPR, walks us through some of the interesting use cases at the intersection of AI, computer vision, and augmented reality technology. In our conversation, we discuss how Flora’s interest in a career in AR/VR developed, the origin of her company Selerio, which was eventually acquired by Streem, and her current research.

We also spend time exploring the difficulties associated with building 3D mesh environments, extracting metadata from those environments, the challenges of pose estimation, and other papers that caught Flora’s eye from the conference.

The complete show notes for this episode can be found at twimlai.com/talk/390. For our complete CVPR series, head to twimlai.com/cvpr20.

Jul 09, 2020
Deep Learning for Automatic Basketball Video Production with Julian Quiroga - #389
42:15

Today we return to our coverage of the 2020 CVPR conference with a conversation with Julian Quiroga, a Computer Vision Team Lead at Genius Sports.

Julian presented his recent paper “As Seen on TV: Automatic Basketball Video Production using Gaussian-based Actionness and Game States Recognition” at the CVSports workshop. We jump right into the paper, discussing details like camera setups and angles, detection and localization of the figures on the court (players, refs, and of course, the ball), and the role that deep learning plays in the process. We also break down how this work applies to different sports, and the ways that Julian is looking to improve on this work for better accuracy. 

The complete show notes for this episode can be found at twimlai.com/talk/389. To follow along with our entire CVPR series, visit twimlai.com/cvpr20.

Thanks again to our friends at Qualcomm for their support of the podcast and sponsorship of this series!

Jul 06, 2020
How External Auditing is Changing the Facial Recognition Landscape with Deb Raji - #388
01:21:47

Today we’re taking a break from our CVPR coverage to bring you this interview with Deb Raji, a Technology Fellow at the AI Now Institute at New York University. 

Over the past week or two, there have been quite a few major news stories in the AI community, including the self-imposed moratorium on facial recognition technology from Amazon, IBM and Microsoft.There was also the release of PULSE, a controversial computer vision model that ultimately sparked a Twitter firestorm involving Yann Lecun and AI ethics researchers, including friend of the show, Timnit Gebru. The controversy echoed into the broader AI community, eventually leading to the former’s departure from Twitter. 

In our conversation with Deb, we dig into these stories in depth, discussing the origins of Deb’s work on the Gender Shades project, how subsequent work put a spotlight on the potential harms of facial recognition technology, and who holds responsibility for dealing with underlying bias issues in datasets.

The complete show notes for this episode can be found at twimlai.com/talk/388.

Jul 02, 2020
AI for High-Stakes Decision Making with Hima Lakkaraju - #387
45:54

Today we’re joined by Hima Lakkaraju, an Assistant Professor at Harvard University with appointments in both the Business School and Department of Computer Science. 

At CVPR, Hima was a keynote speaker at the Fair, Data-Efficient and Trusted Computer Vision Workshop, where she spoke on Understanding the Perils of Black Box Explanations. Hima talks us through her presentation, which focuses on the unreliability of explainability techniques that center perturbations, such as LIME or SHAP, as well as how attacks on these models can be carried out, and what these attacks look like. We also discuss people’s tendency to trust computer systems and their outputs, her thoughts on collaborator (and former TWIML guest) Cynthia Rudin’s theory that we shouldn’t use black-box algorithms, and much more.

For the complete show notes, visit twimlai.com/talk/387. For our continuing CVPR Coverage, visit twimlai.com/cvpr20.

Jun 29, 2020
Invariance, Geometry and Deep Neural Networks with Pavan Turaga - #386
47:14

We continue our CVPR coverage with today’s guest, Pavan Turaga, Associate Professor at Arizona State University, with dual appointments as the Director of the Geometric Media Lab, and Interim Director of the School of Arts, Media, and Engineering.

Pavan gave a keynote presentation at the Differential Geometry in CV and ML Workshop, speaking on Revisiting Invariants with Geometry and Deep Learning. In our conversation, we go in-depth on Pavan’s research integrating physics-based principles into computer vision. We also discuss the context of the term “invariant,” and the role of architectural, loss function, and data constraints on models. Pavan also contextualizes this work in relation to Hinton’s similar Capsule Network research.

Check out the complete show notes for this episode at twimlai.com/talk/386.

Jun 25, 2020
Channel Gating for Cheaper and More Accurate Neural Nets with Babak Ehteshami Bejnordi - #385
55:58

Today we’re joined by Babak Ehteshami Bejnordi, a Research Scientist at Qualcomm.

Babak works closely with former guest Max Welling and is currently focused on conditional computation, which is the main driver for today’s conversation. We dig into a few papers in great detail including one from this year’s CVPR conference, Conditional Channel Gated Networks for Task-Aware Continual Learning

We also discuss the paper TimeGate: Conditional Gating of Segments in Long-range Activities, and another paper from this year’s ICLR conference, Batch-Shaping for Learning Conditional Channel Gated Networks. We cover how gates are used to drive efficiency and accuracy, while decreasing model size, how this research manifests into actual products, and more! 

For more information on the episode, visit twimlai.com/talk/385. To follow along with the CVPR 2020 Series, visit twimlai.com/cvpr20

Thanks to Qualcomm for sponsoring today’s episode and the CVPR 2020 Series!

Jun 22, 2020
Machine Learning Commerce at Square with Marsal Gavalda - #384
51:53

Today we’re joined by Marsal Gavalda, head of machine learning for the Commerce platform at Square. 

Marsal, who hails from Barcelona, Catalonia, kicks off our conversation by indulging Sam in their shared love for language, which is what put him on the path to a career in machine learning. At Square, Marsal manages the development of machine learning for various tools and platforms, including marketing, appointments, and above all, risk management. 

We explore how they manage this vast portfolio of projects, and how having an ML and technology focus at the outset of the company has contributed to their success. We also discuss some of Marsal’s tips and best practices for internal democratization of ML, their approach to developing ML-driven features, the techniques deployed in the development of those features, and much more!

The complete show notes for this episode can be found at twimlai.com/talk/384.

Jun 18, 2020
Cell Exploration with ML at the Allen Institute w/ Jianxu Chen - #383
43:21

Today we’re joined by Jianxu Chen, a scientist in the Assay Development group at the Allen Institute for Cell Science. 

At the latest GTC conference, Jianxu presented his work on the Allen Cell Explorer Toolkit, an open-source project that allows users to do 3D segmentation of intracellular structures in fluorescence microscope images at high resolutions, making the images more accessible for data analysis. 

In our conversation, we discuss three of the major components of the toolkit: the cell image analyzer, the image generator, and the image visualizer. We also explore Jianxu’s transition from computer science into computational biology. More broadly, we cover how the use of GPUs has fundamentally changed this research, and the goals his team had in mind when they began the project.

Check out the complete show notes at twimlai.com/talk/383.

Jun 15, 2020
Neural Arithmetic Units & Experiences as an Independent ML Researcher with Andreas Madsen - #382
30:54

Today we’re joined by Andreas Madsen, an independent researcher based in Denmark whose research focuses on developing interpretable machine learning models. 

While we caught up with Andreas to discuss his ICLR spotlight paper, “Neural Arithmetic Units,” we also spend time exploring his experience as an independent researcher. We discuss the difficulties of working with limited resources, the importance of finding peers to collaborate with, and tempering expectations of getting papers accepted to conferences -- something that might take a few tries to get right.

In his paper, Andreas notes that Neural Networks struggle to perform exact arithmetic operations over real numbers, but this can be helped with the addition of two NN components: the Neural Addition Unit (NAU), which can learn exact addition and subtraction; and the Neural Multiplication Unit (NMU) that can multiply subsets of a vector.

The complete show notes can be found at twimlai.com/talk/382.

Jun 11, 2020
2020: A Critical Inflection Point for Responsible AI with Rumman Chowdhury - #381
01:01:58

Today we’re joined by Rumman Chowdhury, Managing Director and Global Lead of Responsible Artificial Intelligence at Accenture. In our conversation with Rumman, we explored questions like: 

  • Why is now such a critical inflection point in the application of responsible AI?
  • How should engineers and practitioners think about AI ethics and responsible AI?
  • Why is AI ethics inherently personal and how can you define your own personal approach?
  • Is the implementation of AI governance necessarily authoritarian?
  • How do we balance idealism and pragmatism in the application of AI ethics?

We also cover practical topics like how and where you should implement responsible AI in your organization, and building the teams and processes capable of taking on critical ethics and governance questions.

The complete show notes for this episode can be found at twimlai.com/talk/381.

Jun 08, 2020
Panel: Advancing Your Data Science Career During the Pandemic - #380
01:07:16

Today we’re joined by Ana Maria Echeverri, Caroline Chavier, Hilary Mason, and Jacqueline Nolis, our guests for the recent Advancing Your Data Science Career During the Pandemic panel.

In this conversation, we explore ways that Data Scientists and ML/AI practitioners can continue to advance their careers despite current challenges. Our panelists provide concrete tips, advice, and direction for those just starting out, those affected by layoffs, and those just wanting to move forward in their careers.

Topics we cover include:

  • Guerilla Job Hunting
  • Portfolio Building
  • Navigating Hiring Freezes
  • Acing the Technical Interview
  • Presenting the Best Candidate

For more information about our guests, or for links to the resources mentioned, visit the show notes page at twimlai.com/talk/380.

Jun 04, 2020
On George Floyd, Empathy, and the Road Ahead
06:20

Visit twimlai.com/blacklivesmatter for resources to support organizations pushing for social equity like Black Lives Matter, and groups offering relief for those jailed for exercising their rights to peaceful protest. 

Jun 02, 2020
Engineering a Less Artificial Intelligence with Andreas Tolias - #379
46:41

Today we’re joined by Andreas Tolias, Professor of Neuroscience at Baylor College of Medicine and Principal Investigator of the Neuroscience-Inspired Networks for Artificial Intelligence organization.

We caught up with Andreas to discuss his recent perspective piece, “Engineering a Less Artificial Intelligence,” which explores the shortcomings of state-of-the-art learning algorithms in comparison to the brain. The paper also offers several ideas about how neuroscience can lead the quest for better inductive biases by providing useful constraints on representations and network architecture. We discuss the promise of deep neural networks, the differences between inductive bias and model bias, the role of interpretability, and the exciting future of biological systems and deep learning. 

The complete show notes can be found at twimali.com/talk/379.

May 28, 2020
Rethinking Model Size: Train Large, Then Compress with Joseph Gonzalez - #378
52:39

Today we’re joined by Joseph Gonzalez, Assistant Professor in the EECS department at UC Berkeley. 

Our main focus in the conversation is Joseph’s paper “Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers,” which explores compute-efficient training strategies, based on model size.

We discuss the two main problems being solved; 1) How can we rapidly iterate on variations in architecture? And 2) If we make models bigger, is it really improving any efficiency? We also discuss the parallels between computer vision and NLP tasks, how he characterizes both “larger” and “faster” in the paper.

Check out the complete show notes for this episode at twimlai.com/talk/378.

May 25, 2020
The Physics of Data with Alpha Lee - #377
34:29

Today we’re joined by Alpha Lee, Winton Advanced Fellow in the Department of Physics at the University of Cambridge, and Co-Founder of data-driven drug discovery startup, PostEra. Our conversation centers around Alpha’s research which can be broken down into three main categories: data-driven drug discovery, material discovery, and physical analysis of machine learning. 

We discuss the similarities and differences between drug discovery and material science, including the parallels in the design test cycle, and the major differences in cost. We also explore the goals associated with uncertainty estimation, why deep networks are easier to optimize than shallow networks, the concept of energy landscape, and how it all fits into his research. We also talk about his startup, PostEra which offers medicinal chemistry as a service powered by machine learning.

The complete show notes for this episode can be found at twimlai.com/talk/377.

May 21, 2020
Is Linguistics Missing from NLP Research? w/ Emily M. Bender - #376 🦜
52:34

Today we’re joined by Emily M. Bender, Professor of Linguistics at the University of Washington. 

Our discussion covers a lot of ground, but centers on the question, "Is Linguistics Missing from NLP Research?" We explore if we would be making more progress, on more solid foundations, if more linguists were involved in NLP research, or is the progress we're making (e.g. with deep learning models like Transformers) just fine?

Later this afternoon (3pm PT) we’ll be hosting a viewing party with Emily over on our YouTube channel. Sam and Emily will be in the live chat answering your questions from the conversation. Register at twimlai.com/376viewing!

Check out the complete show notes for this conversation at twimlai.com/talk/376.

May 18, 2020
Disrupting DeepFakes: Adversarial Attacks Against Conditional Image Translation Networks with Nataniel Ruiz - #375
42:42

Today we’re joined by Nataniel Ruiz, a PhD Student in the Image & Video Computing group at Boston University. 

We caught up with Nataniel to discuss his paper “Disrupting DeepFakes: Adversarial Attacks Against Conditional Image Translation Networks and Facial Manipulation Systems,” which will be presented at the upcoming CVPR conference. In our conversation, we discuss the concept of this work, which essentially injects noise into an image to disrupt a generative model’s ability to manipulate said image. We also explore some of the challenging parts of implementing this work, a few potential scenarios in which this could be deployed, and the broader contributions that went into this work. 

The complete show notes for this episode can be found at twimlai.com/talk/375.

May 14, 2020
Understanding the COVID-19 Data Quality Problem with Sherri Rose - #374
44:30

Today we’re joined by Sherri Rose, Associate Professor at Harvard Medical School. 

Sherri’s research centers around developing and integrating statistical machine learning approaches to improve human health. We cover a lot of ground in our conversation, including the intersection of her research with the current COVID-19 pandemic, the importance of quality in datasets and rigor when publishing papers, and the pitfalls of using causal inference.

We also touch on Sherri’s work in algorithmic fairness, including the necessary emphasis being put on studying issues of fairness, the shift she’s seen in fairness conferences covering these issues in relation to healthcare research, and her paper “Fair Regression for Health Care Spending.”

Check out the complete show notes for this episode at twimlai.com/talk/374.

May 11, 2020
The Whys and Hows of Managing Machine Learning Artifacts with Lukas Biewald - #373
53:30

Today we’re joined by Lukas Biewald, founder and CEO of Weights & Biases, to discuss their new tool Artifacts, an end to end pipeline tracker. You might remember Lukas from his original interview with us towards the end of last year, for more background on Lukas and W&B we encourage you to check that out here .

In this conversation, we explore Artifacts’ place in the broader machine learning tooling ecosystem through the lens of our eBook “The definitive guide to ML Platforms” and how it fits with the W&B model management platform. We discuss also discuss what exactly “Artifacts” are, what the tool is tracking, and take a look at the onboarding process for users. 

Check out the complete show notes for this episode at twimlai.com/talk/373.

May 07, 2020
Language Modeling and Protein Generation at Salesforce with Richard Socher - #372
42:36

Today we’re joined Richard Socher, Chief Scientist and Executive VP at Salesforce.

Richard, who has been at the forefront of Salesforce’s AI Research since they acquired his startup Metamind in 2016, and his team have been publishing a ton of great projects as of late, including CTRL: A Conditional Transformer Language Model for Controllable Generation, and ProGen, an AI Protein Generator, both of which we cover in-depth in this conversation. We explore the balancing act between investments, product requirement research and otherwise at a large product-focused company like Salesforce, the evolution of his language modeling research since being acquired, and how it ties in with Protein Generation.

The complete show notes for this episode can be found at twimlai.com/talk/372.  

May 04, 2020
AI Research at JPMorgan Chase with Manuela Veloso - #371
45:25

Today we’re joined by Manuela Veloso, Head of AI Research at JPMorgan Chase and Professor at Carnegie Mellon University. Since moving from CMU to JPMorgan Chase, Manuela and her team established a set of seven lofty research goals. In this conversation we focus on the first three: building AI systems to eradicate financial crime, safely liberate data, and perfect client experience. 

We also explore Manuela’s background, including her time as a PhD student at CMU, or as she describes it, the “mecca of AI,” with some of the most influential figures in AI like Geoff Hinton, and Herb Simon on the faculty at the time. We also cover Manuela’s founding role with RoboCup, an annual international competition centered on autonomous robots playing soccer.

The complete show notes for this episode can be found at twimlai.com/talk/371.

Apr 30, 2020
Panel: Responsible Data Science in the Fight Against COVID-19 - #370
57:03

Since the beginning of the coronavirus pandemic, we’ve seen an outpouring of interest on the part of data scientists and AI practitioners wanting to make a contribution. At the same time, some of the resulting efforts have been criticized for promoting the spread of misinformation or being disconnected from the applicable domain knowledge.

In this discussion, we explore how data scientists and ML/AI practitioners can responsibly contribute to the fight against coronavirus and COVID-19. Four experts: Rex Douglass, Rob Munro, Lea Shanley, and Gigi Yuen-Reed shared a ton of valuable insight on the best ways to get involved.

We've gathered all the resources that our panelists discussed during the conversation, you can find those at twimlai.com/talk/370.

Apr 29, 2020
Adversarial Examples Are Not Bugs, They Are Features with Aleksander Madry - #369
41:03

Today we’re joined by Aleksander Madry, Faculty in the MIT EECS Department, a member of CSAIL and of the Theory of Computation group. Aleksander, whose work is more on the theoretical side of machine learning research, walks us through his paper “Adversarial Examples Are Not Bugs, They Are Features,” which was published previously presented at last year’s NeurIPS conference. 

In our conversation, we explore the idea of adversarial examples in machine learning systems being features, with results that might be undesirable, but still working as designed. We talk through what we expect these systems to do, vs what they’re actually doing, if we’re able to characterize these patterns, and what makes them compelling, and if the insights from the paper will inform opinions on either side of the deep learning debate.

The complete show notes for this can be found at twimlai.com/talk/369.

Apr 27, 2020
AI for Social Good: Why "Good" isn't Enough with Ben Green - #368
40:34

Today we’re joined by Ben Green, PhD Candidate at Harvard, Affiliate at the Berkman Klein Center for Internet & Society at Harvard, Research Fellow at the AI Now Institute at NYU. 

Ben’s research is focused on social and policy impacts of data science, with a focus on algorithmic fairness, municipal governments, and the criminal justice system. In our conversation, we discuss his paper ‘Good' Isn't Good Enough,’ which explores the 2 things he feels are missing from data science and machine learning projects, papers and research; A grounded definition of what “good” actually means, and the absence of a “theory of change.” We also talk through how he thinks about the unintended consequence associated with the application of technology to social good, and his theory for the relationship between technology and social impact. 

The complete show notes for this episode can be found at twimlai.com/talk/368.

Apr 23, 2020
The Evolution of Evolutionary AI with Risto Miikkulainen - #367
38:13

Today we’re joined by Risto Miikkulainen, Associate VP of Evolutionary AI at Cognizant AI, and Professor of Computer Science at the UT Austin.

Risto joined us back on episode #47 to discuss evolutionary algorithms, and today we do an update of sorts on what is the latest we should know on the topic. In our conversation, we discuss various use cases for evolutionary AI, the relationship between evolutionary algorithms and reinforcement learning, some of the latest approaches to deploying evolutionary models. We also explore his paper “Better Future through AI: Avoiding Pitfalls and Guiding AI Towards its Full Potential,” which details the historical evolution of AI, discussing where things currently stand, and where they might go in the future. 

The complete show notes for this episode can be found at twimlai.com/talk/367.

Apr 20, 2020
Neural Architecture Search and Google’s New AutoML Zero with Quoc Le - #366
53:43

Today we’re super excited to share our recent conversation with Quoc Le, a research scientist at Google, on the Brain team. Quoc has been very busy recently with his work on Google’s AutoML Zero, which details significant advances in automated machine learning that can  “automatically discover complete machine learning algorithms just using basic mathematical operations as building blocks.”

Another major theme of this conversation is semi-supervised learning, discussing his work on the paper “Self-training with Noisy Student improves ImageNet classification.” Finally, we discuss how his interest in sequence to sequence learning, and a chance encounter, led to the development of Meena, Google’s recent multi-turn conversational chatbot. 

This was a really fun conversation, so much so that we decided to release the video! April 16th at 12 pm PT, Quoc and Sam will premiere the video version of this interview, and answer your questions in the chat. We’ll see you there!

The complete show notes for this episode can be found at twimlai.com/talk/366.

Apr 16, 2020
Automating Electronic Circuit Design with Deep RL w/ Karim Beguir - #365
35:23

Today we’re joined by return guest Karim Beguir, Co-Founder and CEO of InstaDeep. We originally spoke with Karim about InstaDeep’s work back on episode 302, check that episode out for a full brief of Karim’s background.

In today’s conversation, we chat with Karim about InstaDeep’s new offering, DeepPCB, an end-to-end platform for automated circuit board design. We discuss challenges and problems with some of the original iterations of auto-routers, how Karim defines circuit board “complexity,” the differences between reinforcement learning being used for games and in this use case, and their spotlight paper from NeurIPS, co-authored with a team from Deepmind. 

Check out the complete show notes at twimlai.com/talk/365.

Apr 13, 2020
Neural Ordinary Differential Equations with David Duvenaud - #364
48:49

Today we’re joined by David Duvenaud, Assistant Professor at the University of Toronto. David, who joined us back on episode #96 back in January ‘18, is back to talk about the various papers that have come out of his lab over the last year and change, focused on Neural Ordinary Differential Equations, a type of continuous-depth neural network.

In our conversation, we talk through quite a few of David’s papers on the topic, which you can find below on the show notes page. We discuss the problem that David is trying to solve with this research, the potential that ODEs have to replace “the backbone” of the neural networks that are used to train today, and David’s approach to engineering. 

The complete show notes for this episode can be found at twimlai.com/talk/364.

Apr 09, 2020
The Measure and Mismeasure of Fairness with Sharad Goel - #363
47:33

Today we’re joined by Sharad Goel, Assistant Professor in the management science & engineering department at Stanford. Sharad, who also has appointments in the computer science, sociology, and law departments, has spent the recent years focused on applying machine learning to better understand and improve public policy. 

In our conversation, we dive into Sharad’s non-traditional path to academia, which includes extensive work on discriminatory policing, including practices like stop-and-frisk, leading up to his work on The Stanford Open Policing Project, which uses data from over 200 million traffic stops nationwide to “help researchers, journalists, and policymakers investigate and improve interactions between police and the public.” Finally, we discuss Sharad’s paper “The Measure and Mismeasure of Fairness: A Critical Review of Fair Machine Learning,” which identifies three formal definitions of fairness in algorithms, the statistical limitations of each, and details how mathematical formalizations of fairness could be introduced into algorithms.

Check out the complete show notes for this episode at twimlai.com/talk/363.

Apr 06, 2020
Simulating the Future of Traffic with RL w/ Cathy Wu - #362
34:16

Today we’re joined by Cathy Wu, Gilbert W. Winslow Career Development Assistant Professor in the department of Civil and Environmental Engineering at MIT. We had the pleasure of catching up with Cathy at NeurIPS to discuss her talk “Mixed Autonomy Traffic: A Reinforcement Learning Perspective.” 

In our conversation, we discuss Cathy’s transition to applying machine learning to civil engineering, specifically, understanding the potential impact autonomous vehicles would have on traffic once deployed. To better understand this, Cathy built multiple reinforcement learning simulations, including a track and intersection scenarios. We talk through how each scenario is set up, how human drivers are modeled for this simulation, and the results of the experiments.

Check out the complete show notes for this episode at twimlai.com/talk/362.

Apr 02, 2020
Consciousness and COVID-19 with Yoshua Bengio - #361
48:19

Today we’re joined by one of, if not the most cited computer scientist in the world, Yoshua Bengio. Yoshua is a Professor in the Department of Computer Science and Operations Research at the University of Montreal and the Founder and Scientific Director of MILA. We caught up with Yoshua just a few weeks into the coronavirus pandemic, so we spend a bit of time discussing his work both broadly on the impact of AI in society, as well as his current endeavor in building a COVID-19 tracing application, and the use of ML to propose experimental candidate drugs.

We also explore his work on consciousness, including how Yoshua defines consciousness, his paper “The Consciousness Prior,” the relationship between consciousness and intelligence, how attention could be used to train consciousness, the current state of consciousness research, and how he sees it evolving. 

Check out the complete show notes page at twimlai.com/talk/361.

Mar 30, 2020
Geometry-Aware Neural Rendering with Josh Tobin - #360
24:58

Today we’re joined by Josh Tobin, Co-Organizer of the machine learning training program Full Stack Deep Learning, and more recently, the founder of a stealth startup. We had the pleasure of sitting down with Josh prior to his presentation of his paper Geometry-Aware Neural Rendering at NeurIPS.

This work looks to build upon DeepMind’s “Neural scene representation and rendering,” with the goal of developing implicit scene understanding. We discuss challenges, the various datasets used to train his model, and the similarities between variational autoencoder training and his process. 

The complete show notes for this episode can be found at twimlai.com/talk/360.

Mar 26, 2020
The Third Wave of Robotic Learning with Ken Goldberg - #359
01:00:37

Today we’re joined by Ken Goldberg, professor of engineering and William S. Floyd Jr. distinguished chair in engineering at UC Berkeley. Ken, who is also an accomplished artist, and collaborator on projects such as DexNet and The Telegarden, has recently been focusing on robotic learning for grasping.

In our conversation with Ken, we chat about some of the challenges that arise when working on robotic grasping, including uncertainty in perception, control, and physics. We also discuss his view on the role of physics in robotic learning, citing co-contributors Sergey Levine and Pieter Abbeel along the way. Finally, we discuss some of his thoughts on potential robot use cases, from the use of robots in assisting in telemedicine, and agriculture, and even robotic Covid-19 testing.

The complete show notes for this episode can be found at twimlai.com/talk/359.

Mar 23, 2020
Learning Visiolinguistic Representations with ViLBERT w/ Stefan Lee - #358
27:36

Today we’re joined by Stefan Lee, assistant professor at the school of electrical engineering and computer science at Oregon State University. Stefan, who we sat down with at NeurIPS this past winter, is focused on the development of agents that can perceive their environment and communicate their understanding with humans in order to coordinate their actions to achieve mutual goals. 

In our conversation, we focus on his paper ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks, a model for learning joint representations of image content and natural language. We talk through the development and training process for this model, the adaptation of the training process to incorporate additional visual information to BERT models, where this research leads from the perspective of integration between visual and language tasks and finally, we discuss the importance of visual grounding.

Check out the complete show notes page at twimlai.com/talk/358.

Mar 18, 2020
Upside-Down Reinforcement Learning with Jürgen Schmidhuber - #357
33:19

Today we’re joined by Jürgen Schmidhuber, Co-Founder and Chief Scientist of NNAISENSE, the Scientific Director at IDSIA, as well as a Professor of AI at USI and SUPSI in Switzerland.

Jürgen’s lab is well known for creating the Long Short-Term Memory (LSTM) network which has become a prevalent neural network, used commonly devices such as smartphones, which we discuss in detail in our first conversation with Jürgen back in 2017.

In this conversation, we dive into some of Jürgen’s more recent work, including his recent paper, Reinforcement Learning Upside Down: Don’t Predict Rewards — Just Map Them to Actions.

Check out the show notes page at twimlai.com/talk/357.

Mar 16, 2020
SLIDE: Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning with Beidi Chen - #356
31:21
Today we're joined by Beidi Chen, PhD student at Rice University. Beidi is part of the team that developed a cheaper, algorithmic, CPU alternative to state-of-the-art GPU machines. They presented their findings at NeurIPS 2019 and have since gained a lot of attention for their paper, SLIDE: In Defense of Smart Algorithms Over Hardware Acceleration for Large-Scale Deep Learning Systems. In this interview, Beidi shares how the team took a new look at deep learning with the case of extreme classification by turning it into a search problem and using locality-sensitive hashing.
 
Check out the complete show notes at twimlai.com/talk/356. 
Mar 12, 2020
Advancements in Machine Learning with Sergey Levine - #355
42:13

Today we're joined by Sergey Levine, an Assistant Professor in the Department of Electrical Engineering and Computer Science at UC Berkeley. We last heard from Sergey back in 2017, where we explored Deep Robotic Learning. We caught up with Sergey at NeurIPS 2019, where Sergey and his team presented 12 different papers -- which means a lot of ground to cover!

Sergey and his lab’s recent efforts have been focused on contributing to a future where machines can be “out there in the real world, learning continuously through their own experience.” Sergey shares how many of the papers presented at the most recent NeurIPS conference are working to make that happen. Some of the major developments have been in the research fields of model-free reinforcement learning, causality and imitation learning, and offline reinforcement learning.

Check out the complete show notes page at twimlai.com/talk/355.

Mar 09, 2020
Secrets of a Kaggle Grandmaster with David Odaibo - #354
41:15

Imagine spending years learning ML from the ground up, from its theoretical foundations, but still feeling like you didn’t really know how to apply it. That’s where David Odaibo found himself in 2015, after the second year of his PhD. David’s solution was Kaggle, a popular platform for data science competitions.

Fast forward four years, and David is now a Kaggle Grandmaster, the highest designation, with particular accomplishment in computer vision competitions. Having completed his degree last year, he is currently co-founder and CTO of Analytical AI, a company that grew out of one of his recent Kaggle successes.

David has a background in deep learning and medical imaging–something he shares with his brother, Stephen Odaibo, who we interviewed last year about his work in Retinal Image Generation for Disease Discovery.

Check out the full article and interview at twimlai.com/talk/354

Mar 05, 2020
NLP for Mapping Physics Research with Matteo Chinazzi - #353
34:12

Predicting the future of science, particularly physics, is the task that Matteo Chinazzi, an associate research scientist at Northeastern University focused on in his paper Mapping the Physics Research Space: a Machine Learning Approach, along with co-authors including former TWIML AI Podcast guest Bruno Gonçalves.

In addition to predicting the trajectory of physics research, Matteo is also active in the computational epidemiology field. His work in that area involves building simulators that can model the spread of diseases like Zika or the seasonal flu at a global scale. 

Check out our full article on this episode at twimlai.com/talk/353.

Mar 02, 2020
Metric Elicitation and Robust Distributed Learning with Sanmi Koyejo - #352
55:11

The unfortunate reality is that many of the most commonly used machine learning metrics don't account for the complex trade-offs that come with real-world decision making. This is one of the challenges that today’s guest, Sanmi Koyejo has dedicated his research to address.

Sanmi is an assistant professor at the Department of Computer Science at the University of Illinois, where he applies his background in cognitive science, probabilistic modeling, and Bayesian inference to pursue his research which focuses broadly on “adaptive and robust machine learning.”

Check out the full episode write-up at twimlai.com/talk/352.

Feb 27, 2020
High-Dimensional Robust Statistics with Ilias Diakonikolas - #351
34:48

Today we’re joined by Ilias Diakonikolas, faculty in the CS department at the University of Wisconsin-Madison, and author of the paper Distribution-Independent PAC Learning of Halfspaces with Massart Noise, which was the recipient of the NeurIPS 2019 Outstanding Paper award. The paper, which focuses on high-dimensional robust learning, is regarded as the first progress made around distribution-independent learning with noise since the 80s. In our conversation, we explore robustness in machine learning, problems with corrupt data in high-dimensional settings, and of course, a deep dive into the paper. 

Check out our full write up on the paper and the interview at twimlai.com/talk/351.

Feb 24, 2020
How AI Predicted the Coronavirus Outbreak with Kamran Khan - #350
50:05

Today we’re joined by Kamran Khan, founder & CEO of BlueDot, and professor of medicine and public health at the University of Toronto. BlueDot, a digital health company with a focus on surveilling global infectious disease outbreaks, has been the recipient of a lot of attention for being the first to publicly warn about the coronavirus that started in Wuhan. How did the company’s system of algorithms and data processing techniques help flag the potential dangers of the disease? In this interview, Kamran talks us through how the technology works, its limits, and the motivation behind the work. 

Check out our new and improved show notes article at twimlai.com/talk/350.

Feb 19, 2020
Turning Ideas into ML Powered Products with Emmanuel Ameisen - #349
42:53

Today we’re joined by Emmanuel Ameisen, machine learning engineer at Stripe, and author of the recently published book “Building Machine Learning Powered Applications; Going from Idea to Product.” In our conversation, we discuss structuring end-to-end machine learning projects, debugging and explainability in the context of models, the various types of models covered in the book, and the importance of post-deployment monitoring. 

Check out our full show notes article at twimlai.com/talk/349.

Feb 17, 2020
Algorithmic Injustices and Relational Ethics with Abeba Birhane - #348
41:19

Today we’re joined by Abeba Birhane, PhD Student at University College Dublin and author of the recent paper Algorithmic Injustices: Towards a Relational Ethics. We caught up with Abeba, whose aforementioned paper was the recipient of the Best Paper award at the most recent Black in AI Workshop at NeurIPS, to go in-depth on the paper and the thought process around AI ethics.

In our conversation, we discuss the “harm of categorization”, and how the thinking around these categorizations should be discussed, how ML generally doesn’t account for the ethics of various scenarios and how relational ethics could solve this issue, her most recent paper “Robot Rights? Let’s Talk about Human Welfare Instead,” and much more.

Check out our complete write-up and resource page at twimlai.com/talk/348. 

Feb 13, 2020
AI for Agriculture and Global Food Security with Nemo Semret - #347
01:06:38

Today we’re excited to kick off our annual Black in AI Series joined by Nemo Semret, CTO of Gro Intelligence. Gro provides an agricultural data platform dedicated to improving global food security, focused on applying AI at macro scale. In our conversation with Nemo, we discuss Gro’s approach to data acquisition, how they apply machine learning to various problems, and their approach to modeling. 

 

Check out the full interview and show notes at twimlai.com/talk/347.

Feb 10, 2020
Practical Differential Privacy at LinkedIn with Ryan Rogers - #346
33:31

Today we’re joined by Ryan Rogers, Senior Software Engineer at LinkedIn. We caught up with Ryan at NeurIPS, where he presented the paper “Practical Differentially Private Top-k Selection with Pay-what-you-get Composition” as a spotlight talk. In our conversation, we discuss how LinkedIn allows its data scientists to access aggregate user data for exploratory analytics while maintaining its users’ privacy with differential privacy, and the major components of the paper. We also talk through one of the big innovations in the paper, which is discovering the connection between a common algorithm for implementing differential privacy, the exponential mechanism, and Gumbel noise, which is commonly used in machine learning.

 

The complete show notes for this episode can be found at twimlai.com/talk/346

Feb 07, 2020
Networking Optimizations for Multi-Node Deep Learning on Kubernetes with Erez Cohen - #345
34:00

Today we conclude our KubeCon ‘19 Series joined by Erez Cohen, VP of CloudX & AI at Mellanox. In our conversation, we discuss:

  • Erez’s talk “Networking Optimizations for Multi-Node Deep Learning on Kubernetes.” where he discusses problems and solutions related to networking discovered during the journey to reduce training time. 
  • NVIDIA’s recent acquisition of Mellanox, and what fruits that relationship hopes to bear. 
  • The evolution of technologies like RDMA, GPU Direct, and Sharp, Mellanox’s solution to improve the performance of MPI operations, which can be found in NVIDIA’s NCCL collective communications library.
  • How Mellanox is enabling Kubernetes and other platforms to take advantage of the various technologies mentioned above. 
  • Why we should care about networking in Deep Learning, which is inherently a compute-bound process. 

The complete show notes for this episode can be found at twimlai.com/talk/345.

Feb 05, 2020
Managing Research Needs at the University of Michigan using Kubernetes w/ Bob Killen - #344
24:40

Today we’re joined by Bob Killen, Research Cloud Administrator at the University of Michigan. In our conversation, we discuss:

  • How his group is deploying Kubernetes at UM.
  • The user experience of his broad user base, including those using KubeFlow environments.
  • How users are taking advantage of distributed computing.
  • Should ML/AI focused Kubernetes users should fear that the larger non-ML/AI user base will negatively impact their feature needs?
  • Where do the largest gaps currently exist in trying to support ML/AI users’ workloads?
  • Where Bob sees things going from a user perspective, and what are the things those users are asking about most? 

The complete show notes for this episode can be found at twimlai.com/talk/344.

Feb 03, 2020
Scalable and Maintainable Workflows at Lyft with Flyte w/ Haytham AbuelFutuh and Ketan Umare - #343
45:24

Today we kick off our KubeCon ‘19 series joined by Haytham AbuelFutuh and Ketan Umare, a pair of software engineers at Lyft. In our conversation, we discuss: 

  • Their newly open-sourced, cloud-native ML and data processing platform, Flyte.
  • What prompted Ketan to undertake this project and his experience building Flyte.
  • The core value proposition of Flyte.
  • What type-systems mean for the user experience.
  • How Flyte relates to Kubeflow. 
  • How Flyte is used across Lyft.

The complete show notes for this episode can be found at twimlai.com/talk/343

Jan 30, 2020
Causality 101 with Robert Osazuwa Ness - #342
43:14

Today we’re accompanied by Robert Osazuwa Ness, Machine Learning Research Engineer at ML Startup Gamalon and Instructor at Northeastern University. Robert, who we had the pleasure of meeting at the Black in AI Workshop at NeurIPS last month, joins us to discuss:

  • Causality, what it means, and how that meaning changes across domains and users.
  • Benefits of causal models vs non-causal models.
  • Real-world applications of causality. 
  • Various tools and packages for causality, 
  • Areas where it is effectively being deployed, like ML in production.
  • Our upcoming study group based around his new course sequence, “Causal Modeling in Machine Learning,” for which you can find details at twimlai.com/community.

The complete show notes for this episode can be found at twimlai.com/talk/342.

Jan 27, 2020
PaccMann^RL: Designing Anticancer Drugs with Reinforcement Learning w/ Jannis Born - #341
43:13

Today we’re joined by Jannis Born, Ph.D. student at ETH & IBM Research Zurich. We caught up with Jannis a few weeks back at NeurIPS, to discuss: 

  • His research paper “PaccMannRL: Designing anticancer drugs from transcriptomic data via reinforcement learning,” a framework built to accelerate new anticancer drug discovery. 
  • How his background in cognitive science and computational neuroscience applies to his current ML research.
  • How reinforcement learning fits into the goal of cancer drug discovery, and how deep learning has changed this research.
  • Jannis describes a few interesting observations made during the training of their DRL learner. 
  • And of course, Jannis offers us a step-by-step walkthrough of how the framework works to predict the sensitivity of cancer drugs on a cell and subsequently discover new anticancer drugs. 

Check out the complete show notes for this episode at twimlai.com/talk/341

Jan 23, 2020
Social Intelligence with Blaise Aguera y Arcas - #340
46:56

Today we’re joined by Blaise Aguera y Arcas, a distinguished scientist at Google. We had the pleasure of catching up with Blaise at NeurIPS last month, where he was invited to speak on “Social Intelligence.” In our conversation, we discuss:

  • Blaise’s role at Google, where he leads the Cerebra team. 
  • Their approach to machine learning at the company, and how they differ from the more forward-facing Google Brain team. 
  • Blaise gives us a look into his presentation, discussing today’s ML landscape.
  • The gap between AI and ML/DS research, what it means and why it exists.
  • The difference between intelligent systems and what we would deem to be “actual intelligence.” 
  • What does optimizing truly mean when training models?

Check out the complete show notes for this episode at twimlai.com/talk/340.

Jan 20, 2020
Music & AI Plus a Geometric Perspective on Reinforcement Learning with Pablo Samuel Castro - #339
43:49

Today we’re joined by Pablo Samuel Castro, Staff Research Software Developer at Google. Pablo, whose research is mainly focused on reinforcement learning, and I caught up at NeurIPS last month. We cover a lot of ground in our conversation, including his love for music, and how that has guided his work on the Lyric AI project, and a few of his other NeurIPS submissions, including “A Geometric Perspective on Optimal Representations for Reinforcement Learning” and “Estimating Policy Functions in Payments Systems using Deep Reinforcement Learning.” 

Check out the complete show notes at twimlai.com/talk/339.

Jan 16, 2020
Trends in Computer Vision with Amir Zamir - #338
01:30:18

Today we close out AI Rewind 2019 joined by Amir Zamir, who recently began his tenure as an Assistant Professor of Computer Science at the Swiss Federal Institute of Technology.

Amir joined us back in 2018 to discuss his CVPR Best Paper winner, and in today’s conversation, we continue with the thread of Computer Vision. In our conversation, we discuss quite a few topics, including Vision-for-Robotics, the expansion of the field of 3D Vision, Self-Supervised Learning for CV Tasks, and much more! Check out the rest of the series at twimlai.com/rewind19.

The complete show notes for this episode can be found at twimlai.com/talk/338.

 

Jan 13, 2020
Trends in Natural Language Processing with Nasrin Mostafazadeh - #337
01:12:17

Today we continue the AI Rewind 2019 joined by friend-of-the-show Nasrin Mostafazadeh, Senior AI Research Scientist at Elemental Cognition. We caught up with Nasrin to discuss the latest and greatest developments and trends in Natural Language Processing, including Interpretability, Ethics, and Bias in NLP, how large pre-trained models have transformed NLP research, and top tools and frameworks in the space.

The complete show notes can be found at twimlai.com/talk/337

Check out the rest of the series at twimlai.com/rewind19!

Jan 09, 2020
Trends in Fairness and AI Ethics with Timnit Gebru - #336
49:45

Today we keep the 2019 AI Rewind series rolling with friend-of-the-show Timnit Gebru, a research scientist on the Ethical AI team at Google. A few weeks ago at NeurIPS, Timnit joined us to discuss the ethics and fairness landscape in 2019. In our conversation, we discuss diversification of NeurIPS, with groups like Black in AI, WiML and others taking huge steps forward, trends in the fairness community, quite a few papers, and much more.

We want to hear from you! Send your thoughts on the year that was 2019 below in the comments, or via twitter @samcharrington or @twimlai.

The complete show notes for this episode can be found at twimlai.com/talk/336.

Check out the rest of the series at twimlai.com/rewind19!

Jan 06, 2020
Trends in Reinforcement Learning with Chelsea Finn - #335
01:06:57

Today we continue to review the year that was 2019 via our AI Rewind series, and do so with friend of the show Chelsea Finn, Assistant Professor in the Computer Science Department at Stanford University. Chelsea’s research focuses on Reinforcement Learning, so we couldn’t think of a better person to join us to discuss the topic. In this conversation, we cover topics like Model-based RL, solving hard exploration problems, along with RL libraries and environments that Chelsea thought moved the needle last year. 

We want to hear from you! Send your thoughts on the year that was 2019 below in the comments, or via twitter @samcharrington or @twimlai.

The complete show notes for this episode can be found at twimlai.com/talk/335.

Check out the rest of the series at twimlai.com/rewind19!

Jan 02, 2020
Trends in Machine Learning & Deep Learning with Zack Lipton - #334
01:19:42

Today we kick off our 2019 AI Rewind Series joined by Zack Lipton, a jointly appointed Professor in the Tepper School of Business and the Machine Learning Department at CMU.

You might remember Zack from our conversation earlier this year, “Fairwashing” and the Folly of ML Solutionism, which you can find at twimlai.com/talk/285. In our conversation, Zack recaps advancements across the vast fields of Machine Learning and Deep Learning, including trends, tools, research papers and more.

We want to hear from you! Send your thoughts on the year that was 2019 below in the comments, or via Twitter @samcharrington or @twimlai.

To get the complete show notes for this episode, head over to twimlai.com/talk/334. 

 

Dec 30, 2019
FaciesNet & Machine Learning Applications in Energy with Mohamed Sidahmed - #333
40:31

Today we close out our 2019 NeurIPS series with Mohamed Sidahmed, Machine Learning and Artificial Intelligence R&D Manager at Shell. In our conversation, we discuss: 

  • The papers Mohamed and his team submitted to the conference this year, in particular: 
    • Accelerating Least Squares Imaging Using Deep Learning Techniques, which details how researchers can computationally efficiently reconstruct imaging using a deep learning framework approach.

    • FaciesNet: Machine Learning Applications for Facies Classification in Well Logs, which Mohamed describes as “A novel way of designing a new architecture for how we use sequence modeling and recurrent networks to be able to break out of the benchmark for classifying the different types of rock.” 

The full show notes for this episode can be found at twimlai.com/talk/333. Make sure you head over to twimlai.com/neurips2019 to follow along with this series!

Dec 27, 2019
Machine Learning: A New Approach to Drug Discovery with Daphne Koller - #332
43:40

Today we continue our 2019 NeurIPS coverage joined by Daphne Koller, co-Founder and former co-CEO of Coursera and Founder and CEO of Insitro. We caught up with Daphne to discuss: 

  • Her background in machine learning, beginning in ‘93, and her work with the Stanford online machine learning courses, and eventually her work at Coursera.
  • The current landscape of pharmaceutical drug discovery, including the current pricing of drugs and misnomers with why drugs are so expensive, 
  • Her work at Insitro, a company looking to advance drug discovery and development with machine learning. 
  • An overview of Insitro’s goal of using ML as a “compass” in drug discovery. 
  • How Insitro functions as a company in this space, including their focus on the biology of drug discovery and the landscape of ML techniques being used
  • Daphne’s thoughts on AutoML, and much more!

The full show notes for this episode can be found at twimlai.com/talk/332. Make sure you head over to twimlai.com/neurips2019 to follow along with this series!

Dec 26, 2019
Sensory Prediction Error Signals in the Neocortex with Blake Richards - #331
41:05

Today we continue our 2019 NeurIPS coverage, this time around joined by Blake Richards, Assistant Professor at McGill University and a Core Faculty Member at Mila. In our conversation, we discuss:

  • His invited talk at the Neuro-AI Workshop “Sensory Prediction Error Signals in the Neocortex.” 
  • His recent studies on two-photon calcium imaging, predictive coding, and hierarchical inference.
  • Blake’s recent work on memory systems for reinforcement learning. 

The complete show notes for this episode can be found at twimlai.com/talk/331.

Make sure you head over to twimlai.com/neurips2019 to follow along with this series!

Dec 24, 2019
How to Know with Celeste Kidd - #330
54:03

Today we begin our coverage of the 2019 NeurIPS conference with Celeste Kidd, Assistant Professor of Psychology at UC Berkeley. In our conversation, we discuss:

  • The research at the Kidd Lab, which is focused on understanding “how people come to know what they know.”
  • Her invited talk “How to Know,” which details the core cognitive systems people use to guide their learning about the world.
  • Why people are curious about some things but not others.
  • How our past experiences and existing knowledge shape our future interests.
  • Why people believe what they believe, and how these beliefs are influenced in one direction or another.
  • How machine learning figures into this equation.

Check out the complete show notes for this episode at twimlai.com/talk/330. You can also follow along with this series at twimlai.com/neurips2019.

Dec 23, 2019
Using Deep Learning to Predict Wildfires with Feng Yan - #329
49:49

Today we’re joined by Feng Yan, Assistant Professor at the University of Nevada, Reno. In our conversation, we discuss:

  • ALERTWildfire, a camera-based network infrastructure that captures satellite imagery of wildfires.
  • The many purposes of ALERTWildfire, including the discovery of wildfires, the ability to scale resources accordingly, and a few others
  • The development of the machine learning models and surrounding infrastructure used in ALERTWildfire. 
  • Problem formulation and challenges with using camera and satellite data in this use case.
  • How they have combined the use of Infra-as-a-Service and Function-as-a-Service tools for cost-effectiveness and scalability. 

Check out the complete show notes at twimlai.com/talk/329.

Dec 20, 2019
Advancing Machine Learning at Capital One with Dave Castillo - #328
33:26

Today we’re joined by Dave Castillo, Managing Vice President for ML at Capital One and head of their Center for Machine Learning. We caught up with David at re:Invent to discuss the aforementioned Center for Machine Learning, and what has changed since our last discussing with Capital One, which you can find at twimlai.com/talk/147. In our conversation we explore:

  • Capital One’s transition from “lab-based” machine learning to “enterprise-wide” adoption and support of ML.
  • Surprising machine learning use cases like granting employee access privileges via an automated system.
  • Their current platform ecosystem, including their design vision in building this into a larger, all-encompassing platform, pain points in building this platform, and more. 

Check out the complete show notes for this episode at twimlai.com/talk/328.

Dec 19, 2019
Helping Fish Farmers Feed the World with Deep Learning w/ Bryton Shang - #327
38:06

Today we’re joined by Bryton Shang, Founder & CEO at Aquabyte. We caught up with Bryton after his talk at re:Invent’s ML Summit to discuss:

  • Aquabyte, a company focused on the application of computer vision fish farming.
  • How Bryton identified the various problems associated with mass fish farming and how he eventually moved to Norway to develop the solution.
  • The challenges with developing machine learning solutions that can measure the height and weight of fish,
  • How they use computer vision algorithms to asses issues like sea lice, which can be up to 25% of the cost associated with running farms.
  • Cool new features currently in the works like facial recognition for fish!

The complete show notes for this episode can be found at twimlai.com/talk/327.

Dec 17, 2019
Metaflow, a Human-Centric Framework for Data Science with Ville Tuulos - #326
56:17

Today we kick off our re:Invent 2019 series with Ville Tuulos, Machine Learning Infrastructure Manager at Netflix. At re:Invent, Netflix announced the open-sourcing of Metaflow, their “human-centric framework for data science.” In our conversation, we discuss all things Metaflow, including:

  • The problem Metaflow is trying to solve
  • Why it was important for Netflix to open-source Metaflow
  • Core Features
  • The user experience accessing and managing data, experimentation, training and model development
  • The various supported tools and libraries


If you’re interested in checking out a Metaflow democast with Villa, reach out at twimlai.com/contact! 

Dec 13, 2019
Single Headed Attention RNN: Stop Thinking With Your Head with Stephen Merity - #325
59:04

Today we’re joined by Stephen Merity, startup founder and independent researcher, with  a focus on NLP and Deep Learning. In our conversation, we discuss:

  • Stephen’s newest paper, Single Headed Attention RNN: Stop Thinking With Your Head.
  • His motivations behind writing the paper; the fact that NLP research has been recently dominated by the use of transformer models, and the fact that these models are not the most accessible/trainable for broad use.
  • The architecture of transformers models.
  • How Stephen decided to use SHA-RNNs for this research.
  • How Stephen built and trained the model, for which the code is available on Github.
  • His approach to benchmarking this project.
  • Stephen’s goals for this research in the broader NLP research community. 

The complete show notes for this episode can be found at twimlai.com/talk/325. There you’ll find links to both the paper referenced in this interview, and the code. Enjoy!

Dec 12, 2019
Automated Model Tuning with SigOpt - #324
46:10

In this TWIML Democast, we're joined by SigOpt Co-Founder and CEO Scott Clark. Scott details the SigOpt platform, and gives us a live demo!

This episode is best consumed by watching the corresponding video demo, which you can find at twimlai.com/talk/324

 

 

Dec 09, 2019
Automated Machine Learning with Erez Barak - #323
43:25

In the final episode of our Azure ML series, we’re joined by Erez Barak, Partner Group Manager of Azure ML at Microsoft. In our conversation, we discuss:

  • Erez’s AutoML philosophy, including how he defines “true AutoML” and his take on the AutoML space, its role and its importance.
  • We also discuss in great detail the application of AutoML as a contributor to the end-to-end data science process, which Erez breaks down into 3 key areas; Featurization, Learner/Model Selection, and Tuning/Optimizing Hyperparameters.
  • Finally, we discuss post-deployment AutoML use cases and other areas under the AutoML umbrella that are currently generating excitement.

Check out the complete show notes at twimlai.com/talk/323!

Dec 06, 2019
Responsible AI in Practice with Sarah Bird - #322
38:41

Today we continue our Azure ML at Microsoft Ignite series joined by Sarah Bird, Principal Program Manager at Microsoft. In our conversation, we discuss:

  • Sarah’s work in machine learning systems, with a focus on bringing machine learning research into production through Azure ML, with an emphasis on responsible AI.

  • A set of newly released tools focused on responsible machine learning, Azure Machine Learning 'Machine Learning Interpretability Toolkit’
  • Moving from “Black-Box” models to “Glass-Box Models”
  • Sarah’s recent work in differential privacy, including risks and benefits
  • Her work in the broader ML community, including being a founding member of the MLSys conference and workshops.

Check out the complete show notes at twimlai.com/talk/322.

Dec 04, 2019
Enterprise Readiness, MLOps and Lifecycle Management with Jordan Edwards - #321
39:43

Today we’re joined by Jordan Edwards, Principal Program Manager for MLOps on Azure ML at Microsoft. In our conversation, Jordan details:

  • How Azure ML accelerates model lifecycle management with MLOps, enabling data scientists to collaborate with IT teams to increase the pace of model development and deployment.
  • Problems associated with generalizing ML at scale at Microsoft, and how those problems are prioritized, 
  • What is MLOps, and the role of testing is in an MLOps environment, and experiences working with customers to implement these tests. 
  • The “four phases” along the journey of customer implementation of MLOps, how companies should look at hiring ML Engineers vs DevOps Engineers, and other aspects of managing model life cycles that Jordan finds important for us to think about. 

The complete show notes can be found at twimlai.com/talk/321. 

Dec 02, 2019
DevOps for ML with Dotscience - #320
47:04

Today we’re joined by Luke Marsden, Founder and CEO of Dotscience. Luke walks us through the Dotscience platform and their manifesto on DevOps for ML.

Thanks to Luke and Dotscience for their sponsorship of this Democast and their continued support of TWIML.  

Head to https://twimlai.com/democast/dotscience to watch the full democast!

Nov 26, 2019
Building an Autonomous Knowledge Graph with Mike Tung - #319
44:47

Today we’re joined by Mike Tung, Founder, and CEO of Diffbot. In our conversation, we discuss: 

  • Their various tools, including their Knowledge Graph, Extraction API, and CrawlBot.
  • How Knowledge Graph was inspired by Imagenet, how it was built, and how it differs from other, more mainstream knowledge graphs like Google Search and MSFT Bing.
  • How they balance being a research company that is also commercially viable.
  • The developer experience with their tools, and challenges faced.

The complete show notes can be found at twimlai.com/talk/319.

Nov 21, 2019
The Next Generation of Self-Driving Engineers with Aaron Ma - Talk #318
47:53

Today we’re joined by our youngest guest ever (by far), Aaron Ma, an 11-year-old middle school student and machine learning engineer in training. Aaron has completed over 80(!) Coursera courses and is the recipient of 3 Udacity nano-degrees. In our conversation, we discuss:

  • Aaron’s research interests, reinforcement learning, and self-driving cars,
  • His experiences participating in over 35 kaggle competitions
  • How he balances his passion for machine learning with things like chores and homework.

This was a really fun interview! 

The complete show notes for this episode can be found at twimlai.com/talk/318.

Nov 18, 2019
Spiking Neural Networks: A Primer with Terrence Sejnowski - #317
49:34

On today’s episode, we’re joined by Terrence Sejnowski, Francis Crick Chair, head of the Computational Neurobiology Laboratory at the Salk Institute for Biological Studies and faculty member at UC San Diego. In our conversation with Terry, we discuss:

  • His role as a founding researcher in the field of computational neuroscience, and as a founder of the annual Telluride Neuromorphic Cognition Engineering Workshop. 
  • We dive deep into the world of spiking neural networks and brain architecture,
  • the relationship of neuroscience to machine learning, and ways to make NN’s more efficient through spiking. 
  • Terry also gives us some insight into hardware used in this field, characterizes the major research problems currently being undertaken, and the future of spiking networks. 

Check out the complete show notes at twimlai.com/talk/317.

 

Nov 14, 2019
Bridging the Patient-Physician Gap with ML and Expert Systems w/ Xavier Amatriain - #316
39:01

Today we’re joined by return guest Xavier Amatriain, Co-founder and CTO of Curai. In our conversation, we discuss

  • Curai’s goal of providing the world’s best primary care to patients via their smartphone, and how ML & AI will bring down costs healthcare accessible and scaleable. 
  • The shortcomings of traditional primary care, and how Curai fills that role, 
  • Some of the unique challenges his team faces in applying this use case in the healthcare space. 
  • Their use of expert systems, how they develop and train their models with synthetic data through noise injection
  • How NLP projects like BERT, Transformer, and GPT-2 fit into what Curai is building. 

Check out the complete show notes page at twimlai.com/talk/316

Nov 11, 2019
What Does it Mean for a Machine to "Understand"? with Thomas Dietterich - #315
38:09

Today we’re joined by Tom Dietterich, Distinguished Professor Emeritus at Oregon State University. We had the pleasure of discussing Tom’s recent blog post, “What does it mean for a machine to “understand,” in which he discusses:

  • Tom’s position on what qualifies as machine “understanding”, including a few examples of systems that he believes exhibit understanding.
  • The role of deep learning in achieving artificial general intelligence.
  • The current “Hype Engine” that exists around AI Research, and SOOO much more.  

Make sure you check out the show notes at twimlai.com/talk/315, where you’ll find links to Tom’s blog post, as well as a ton of other references. 

Nov 07, 2019
Scaling TensorFlow at LinkedIn with Jonathan Hung - #314
35:07

Today we’re joined by Jonathan Hung, Sr. Software Engineer at LinkedIn, who we caught up with at TensorFlow World last week. In our conversation, we discuss: 

  • Jonathan’s presentation at the event focused on LinkedIn’s efforts scaling Tensorflow.
  • Jonathan’s work as part of the Hadoop infrastructure team, including experimenting on Hadoop with various frameworks, and their motivation for using TensorFlow on their pre-existing Hadoop clusters infrastructure. 
  • TonY, or TensorFlow on Yard, LinkedIn’s framework that natively runs deep learning jobs on Hadoop, and its relationship with Pro-ML, LinkedIn’s internal AI Platform, which we’ve discussed on earlier episodes of the podcast (Link).
  • Finally, we discuss how far LinkedIn’s Hadoop infrastructure has come since 2017, and their foray into using Kubernetes for research. 

The complete show notes can be found at twimlai.com/talk/314.

Nov 04, 2019
Machine Learning at GitHub with Omoju Miller - #313
43:41

Today we’re joined by Omoju Miller, a Sr. machine learning engineer at GitHub. In our conversation, we discuss:

  • Her dissertation, Hiphopathy, A Socio-Curricular Study of Introductory Computer Science, 
  • Her work as an inaugural member of the Github machine learning team
  • Her two presentations at Tensorflow World, “Why is machine learning seeing exponential growth in its communities” and “Automating your developer workflow on GitHub with Tensorflow.”

The complete show notes for this episode can be found at twimlai.com/talk/313. 

Oct 31, 2019
Using AI to Diagnose and Treat Neurological Disorders with Archana Venkataraman - #312
47:48

Today we’re joined by Archana Venkataraman, John C. Malone Assistant Professor of Electrical and Computer Engineering at Johns Hopkins University, and MIT 35 innovators under 35 recipient.

Archana’s research at the Neural Systems Analysis Laboratory focuses on developing tools, frameworks, and algorithms to better understand, and treat neurological and psychiatric disorders, including autism, epilepsy, and others. In our conversation, we explore her lab’s work in applying machine learning to these problems, including biomarker discovery, disorder severity prediction, as well as some of the various techniques and frameworks used.

The complete show notes for this episode can be found at twimlai.com/talk/312.

Oct 28, 2019
Deep Learning for Earthquake Aftershock Patterns with Phoebe DeVries & Brendan Meade - #311
35:44

Today we are joined by Phoebe DeVries, Postdoctoral Fellow in the Department of Earth and Planetary Sciences at Harvard and assistant faculty at the University of Connecticut and Brendan Meade, Professor of Earth and Planetary Sciences and affiliate faculty in computers sciences at Harvard. In this episode, we discuss:

  • Phoebe and Brendan’s work is focused on discovering as much as possible about earthquakes before they happen, and through measuring how the earth’s surface moves, predicting future movement location
  • Their recent paper, ‘Deep learning of aftershock patterns following large earthquakes’, and 
  • The preliminary steps that guided them to using machine learning in the earth sciences
  • Their current research involving calculating stress changes in the crust and upper mantle after a large earthquake and using a neural network to map those changes to predict aftershock locations
  • The complex systems that encompass earth science studies, including the approaches, challenges, surprises, and results that come with incorporating machine learning models and data sets into a new field of study

The complete show notes for this episode can be found at twimlai.com/talk/311.

Oct 25, 2019
Live from TWIMLcon! Operationalizing Responsible AI - #310
30:33

An often forgotten about topic garnered high praise at TWIMLcon this month: operationalizing responsible and ethical AI. This important topic was combined with an impressive panel of speakers, including: Rachel Thomas, Director, Center for Applied Data Ethics at the USF Data Institute, Guillaume Saint-Jacques, Head of Computational Science at LinkedIn, and Parinaz Sobahni, Director of Machine Learning at Georgian Partners, moderated by Khari Johnson, Senior AI Staff Writer at VentureBeat. This episode covers:

  • The basics of operationalizing AI ethics in a range of orgs and insight into an array of tools, approaches, and methods that have been found useful for teams to use
  • The biggest concerns, like focusing more on harm as opposed to algorithmic bias and encouraging specific responsibility for systems
  • Educating the general public on the realities and misconceptions of probabilistic methods and putting into place preventative guardrails has become imperative for any operation
  • The long-term benefits of ethical decision-making and the challenges of established versus startup companies
  • Questions from the TWIMLcon audience, some common examples of power dynamics in AI ethics, and what we as a community can be doing to push the needle in the very powerful world of responsible AI

The complete show notes can be found at twimlai.com/talk/310

Oct 22, 2019
Live from TWIMLcon! Scaling ML in the Traditional Enterprise - #309
33:37

In this episode from a stellar TWIMLcon panel, the state and future of larger, more established brands is analyzed and discussed. Hear from Amr Awadallah, Founder and Global CTO of Cloudera, Pallav Agrawal, Director of Data Science at Levi Strauss & Co., and Jürgen Weichenberger, Data Science Senior Principal & Global AI Lead at Accenture, moderated by Josh Bloom Professor at UC Berkeley. In this episode we discuss:

  • For an ML/AI initiative to be successful, a conscious and noticeable shift is now required in how things used to be managed while educating cross-functional teams in data science terms and methodologies 
  • It can be tempting and exciting to constantly be trying out the latest technologies, but brand consistency and sustainability is imperative to success
  • How the real business value - the money - can be found by putting your big ML/AI goals and projects in the core competencies of the company.  
  • Are traditional enterprises fundamentally changing their business through ML/AI, and if so, why? 
  • Real-world examples and thought-provoking ideas for scaling ML/AI in the traditional enterprise

The complete show notes can be found at twimlai.com/talk/309.

Oct 18, 2019
Live from TWIMLcon! Culture & Organization for Effective ML at Scale (Panel) - #308
27:59

TWIMLcon brought together so many in the ML/AI community to discuss the unique challenges to building and scaling machine learning platforms. In this episode, hear from a diverse set of panelists including: Pardis Noorzad, Data Science Manager at Twitter, Eric Colson, Chief Algorithms Officer Emeritus at Stitch Fix, and Jennifer Prendki, Founder & CEO at Alectio, moderated by Maribel Lopez, Founder & Principal Analyst at Lopez Research:

  • How to approach changing the way companies think about machine learning
  • Engaging different groups to work together effectively - i.e. c-suite, marketing, sales, engineering, etc. 
  • The importance of clear communication about ML lifecycle management
  • How full stack roles can provide immense value
  • Tips and tricks to work faster, more efficiently, and create an org-wide culture that holds machine learning as a valued priority

The complete show notes can be found at twimlai.com/talk/308.

Oct 15, 2019
Live from TWIMLcon! Use-Case Driven ML Platforms with Franziska Bell - #307
32:14

Franziska Bell, Ph.D., is the Director of Data Science Platforms at Uber, and joined Sam on stage at TWIMLcon last week to discuss all things platform at Uber. With the goal of providing cutting edge data science company-wide at the push of a button, Fran has developed a portfolio of platforms, ranging from forecasting to anomaly detection to conversational AI. In this episode, we discuss:

  • Through strategic use cases, Fran’s team of data scientists works closely with teams across the organization at every stage to solve problems and build infrastructure
  • The evolving working relationship between her team and Michelangelo (Uber’s ML Platform), including the challenges and benefits that such a platform provides
  • Insight into Uber’s development methodology and how the data science team is organized from start to finish to create a culture of learning and expertise that results in fast results and reduced risk
  • Fran’s take on the future of ML platforms and more!

Check out the complete show notes at twimlai.com/talk/307

Oct 10, 2019
Live from TWIMLcon! Operationalizing ML at Scale with Hussein Mehanna - #306
33:39

The live interviews from TWIMLcon continue with Hussein Mehanna, Head of Machine Learning and Artificial Intelligence at Cruise. From his start at Facebook and then Google and now to Cruise, leading the trend of autonomous vehicles, Hussein has seen first hand what it takes to scale and sustain machine learning programs. In this episode, hear him and Sam discuss:

  • At Facebook, a few early wins in the realm of infrastructure building set the stage for scaling via faster algorithms and soon the entire Facebook organization could achieve a new level of ML scaling with all workflows shareable, reusable and discoverable through a search interface
  • Cruise’s unique focus on the interplay between applied research problems and the underlying tools and platforms
  • The immense capacity that the industry of autonomous vehicles has to push ML and AI to new limits of depth and scale
  • The challenges (and joys) of working in the industry and his insight into analyzing scale when innovation is happening in parallel with the development
  • Hussein’s experiences at Facebook, Google, and Cruise, along with his thoughts on productivity being a "usability" vs "modeling" challenge and his prediction for the future of ML platforms!

The complete show notes can be found at twimlai.com/talk/306.

Oct 08, 2019
Live from TWIMLcon! Encoding Company Culture in Applied AI Systems - #305
32:22

In this episode, Sam is joined by Deepak Agarwal, VP of Engineering at LinkedIn, who graced the stage at TWIMLcon: AI Platforms for a keynote interview. In this episode Deepak shares:

  • The incredible impact that standardizing processes and tools have on a company’s culture and overall productivity levels
  • Insight into the best way to increase ML ROI and how to sell ML programs to the C-Suite (two things that often go hand in hand)
  • The Pro-ML initiative for delivering machine learning systems at scale, specifically looking at aligning improvement of tooling and infrastructure with the pace of innovation and more!

Check out the complete show notes at twimlai.com/talk/305.

Oct 04, 2019
Live from TWIMLcon! Overcoming the Barriers to Deep Learning in Production with Andrew Ng - #304
33:59

Earlier today, Andrew Ng joined us onstage at TWIMLcon to share some of his immense knowledge. As the Founder and CEO of Landing AI, Co-Chairman and Co-Founder of Coursera, and founding lead of Google Brain, Andrew is no stranger to knowing what it takes for AI and machine learning to be successful.

In this episode, hear about:

  • The work that Landing AI is doing to help organizations adopt modern AI
  • His experiences in overcoming the challenges that large companies face
  • Insight into how enterprises can get the most value for their ML investment
  • The ‘essential complexity’ of software engineering and more! 

The complete show notes can be found at twimlai.com/talk/304.

Oct 01, 2019
The Future of Mixed-Autonomy Traffic with Alexandre Bayen - #303
43:44

Today we are joined by Alexandre Bayen, Director of the Institute for Transportation Studies and Professor at UC Berkeley.In this episode, we discuss Alex’s background in machine learning, his current research in mixed-autonomy traffic, and the idea of swarming in terms of the impact just a few self-driving cars can have on traffic mobility. In the AWS re:Invent conference last year, Alex presented on the future of mixed-autonomy traffic and the two major revolutions he predicts will take place in the next 10-15 years. This includes model-free deep reinforcement learning techniques and end-to-end pixel learning. Looking ahead, Alex shares his take on the future of transportation systems and the potential for varying levels of automation in sub-communities.

The complete show notes can be found at twimlai.com/talk/303.

Sep 27, 2019
Deep Reinforcement Learning for Logistics at Instadeep with Karim Beguir - #302
43:45

Today we are joined by Karim Beguir, Co-Founder and CEO of InstaDeep, a company in Tunisia, Africa focusing on building advanced decision-making systems for the enterprise. In this episode, we discuss where his and InstaDeep’s journey began in Tunisia, Africa (00:27), the challenges that enterprise companies are seeing in logistics that can be solved by deep learning and machine learning (05:43), how InstaDeep is applying DL and RL to real world problems (09:45), and what are the data sets used to train these models and the application of transfer learning between similar data sets (13:00). Additionally, we go over ‘Rank Rewards’, a paper Karim published last year, in which adversarial self-play in two-player games has delivered impressive results when used with reinforcement learning algorithms (22:40), the overall efficiency of RL for logistical problems (23:05), and details on the InstaDeep process (35:37).

The complete show notes for this episode can be found at twimlai.com/talk/302. 

Sep 25, 2019
Deep Learning with Structured Data w/ Mark Ryan - #301
39:30

Today we're joined by Mark Ryan, author of Deep Learning with Structured Data, currently in the Manning Early Access Program (MEAP), due for publication in Spring 2020. While working on the Support team at IBM Data and AI, he saw that there was a lack of general structured data sets that people could apply their models to. Using the streetcar network in his hometown of Toronto, Mark created a deep learning model to predict delays, but more importantly, gathered an open data set that was the perfect size and variety, and jump started the research for his latest book. In this episode, Mark shares the benefits of applying deep learning to structured data (and recent reduced barriers to entry), details of his experience with a range of data sets, the everlasting appreciation he and Sam shares for the Fast.ai course by Jeremy Howard, and the contents of his new book, aimed to help set up and maintain deep learning models with structured data.

With just two weeks left, time is running out for you to register for TWIMLcon: AI Platforms. Don't be left out! Register NOW at twimlcon.com/register

Sep 19, 2019
Time Series Clustering for Monitoring Fueling Infrastructure Performance with Kalai Ramea - #300
30:04

Today we're joined by Kalai Ramea, Data Scientist at PARC, a Xerox Company. With a background in transportation, energy efficiency, art, and machine learning, Kalai has been fortunate enough to follow her passions through her work. In this episode we discuss:

  • Her environmentally efficient pursuit that lead to the purchase of a hydrogen car, and the subsequent journey and paper that followed assessing fueling stations 
  • Kalai’s next paper, looking at fuel consumption at hydrogen stations using temporal clustering to identify signatures of usage over time, grouping the stations into categories 
  • With the construction of fueling stations is planned to increase dramatically in the next 5 years, building reliability on their performance is crucial
  • A sneak peek into how Kalai incorporates her love of art into her work!

Check out the show notes, and the refresh, at twimlai.com

Sep 18, 2019
Swarm AI for Event Outcome Prediction with Gregg Willcox - TWIML Talk #299
42:35

Today we're joined by Gregg Willcox, Director of Research and Development at Unanimous AI. Inspired by the natural phenomenon called 'swarming', which uses the collective intelligence of a group to produce more accurate results than an individual alone, ‘Swarm AI’ was born. A game-like platform that channels the convictions of individuals to come to a consensus and using a behavioral neural network trained on people’s behavior called ‘Conviction’, to further amplify the results. 

The complete show notes for this episode can be found at twimlai.com/talk/299.

We're just over two weeks out from TWIMLcon: AI Platforms! You definitely want to be there. Visit twimlcon.com for more info, or to register. 

Sep 13, 2019
Rebooting AI: What's Missing, What's Next with Gary Marcus - TWIML Talk #298
47:49

Today we're joined by Gary Marcus, CEO and Founder at Robust.AI, former CEO and Founder of Geometric Intelligence (acquired by Uber) and well-known scientist, bestselling author, professor and entrepreneur. In this episode hear Gary discuss:

  • His latest book, ‘Rebooting AI: Building Artificial Intelligence We Can Trust’, an extensive look into the current gaps, pitfalls and areas for improvement in the field of machine learning and AI 
  • A break down of the difference between reinforcement learning and real learning 
  • Why we need machines with both automation and autonomy to be truly usable in the world today 
  • Examples from his book, including Teslas driving into tow trucks and Microsoft’s SQuAD reading test results
  • Insight into what we should be talking and thinking about to make even greater (and safer) strides in AI

The complete show notes for this episode can be found at twimlai.com/talk/298.

Only 3 weeks left to register for TWIMLcon: AI Platforms! Visit twimlcon.com/register now!

 

Sep 10, 2019
DeepQB: Deep Learning to Quantify Quarterback Decision-Making with Brian Burke - TWIML Talk #297
51:15

Today we're joined by Brian Burke, Analytics Specialist with the Stats & Information Group at ESPN. A former Navy pilot and lifelong football fan, Brian saw the correlation between fighter pilots and quarterbacks in the quick, pressure-filled decisions both roles have to make on a regular basis. In this episode, we discuss:

  • Brian’s self-taught modeling techniques and his journey finding and handling vast amounts of sports data 
  • His findings in the paper, “DeepQB: Deep Learning with Player Tracking to Quantify Quarterback Decision-Making & Performance”
  • Brian talks through the making of his model, with geometry, algebra and a self-proclaimed ‘vanilla’ neural network
  • His excitement for the future of machine learning in sports and more!

The complete show notes for this episode can be found at twimlai.com/talk/297.

Sep 05, 2019
Measuring Performance Under Pressure Using ML with Lotte Bransen - TWIML Talk #296
34:57

Today we're joined by Lotte Bransen, Scientific Researcher at SciSports. With a background in mathematics, econometrics and soccer, Lotte has honed her research on analytics of the game and its players. More specifically, using trained models to understand the impact of mental pressure on a player’s performance. In this episode, Lotte discusses:

  • Her latest paper, ‘Choke or Shine? Quantifying Soccer Players' Abilities to Perform Under Mental Pressure’ and shares 
  • The basis of the models through two aspects of mental pressure: pre-game and in-game, and three performance metrics: the chance of a goal with every action a player takes (contribution), the quality of that decision and the quality of the execution
  • The implications of her research in the world of sports
  • Just a few of the exponential applications for her work - check it out!

Check out the full show notes at twimlai.com/talk/296.

Sep 03, 2019
Managing Deep Learning Experiments with Lukas Biewald - TWIML Talk #295
43:39

Today we're joined by Lukas Biewald, CEO and Co-Founder of Weights & Biases. Lukas, previously CEO and Founder of Figure Eight (CrowdFlower), has a straightforward goal: provide researchers with SaaS that is easy to install, simple to operate, and always accessible. Seeing a need for reproducibility in deep learning experiments, Lukas founded Weights & Biases. In this episode we discuss:

  • The experiment tracking tool, how it works, and the components that make it unique in the ML marketplace
  • The open, collaborative culture that Lukas promotes
  • How Lukas got his start in deep learning experiments, what his experiment tracking used to look like, 
  • The current Weights & Biases business success strategy and what his team is working on today

The complete show notes for this episode can be found at twimlai.com/talk/295

Thanks to our friends at Weights & Biases for their support of the show, their sponsorship of this episode, and our upcoming event, TWIMLcon: AI Platforms. 

Registration for TWIMLcon is still open! Visit twimlcon.com/register today! 

Aug 29, 2019
Re-Architecting Data Science at iRobot with Angela Bassa - TWIML Talk #294
49:27

Today we’re joined by Angela Bassa, Director of Data Science at iRobot. In our conversation, Angela and I discuss:

• iRobot's re-architecture, and a look at the evolution of iRobot.

• Where iRobot gets its data from and how they taxonomize data science.

• The platforms and processes that have been put into place to support delivering models in production.

•The role of DevOps in bringing these various platforms together, and much more!

The complete show notes can be found at twimlai.com/talk/294.

Check out the recently released speaker list for TWIMLcon: AI Platforms now! twimlcon.com/speakers.

Aug 26, 2019
Disentangled Representations & Google Research Football with Olivier Bachem - TWIML Talk #293
43:29

Today we’re joined by Olivier Bachem, a research scientist at Google AI on the Brain team.

Initially, Olivier joined us to discuss his work on Google’s research football project, their foray into building a novel reinforcement learning environment, but we spent a fair amount of time exploring his research in disentangled representations. Olivier and Sam also discuss what makes the football environment different than other available reinforcement learning environments like OpenAI Gym and PyGame, what other techniques they explored while using this environment, and what’s on the horizon for their team and Football RLE.

Check out the full show notes at twimlai.com/talk/293

Aug 22, 2019
Neural Network Quantization and Compression with Tijmen Blankevoort - TWIML Talk #292
51:09

Today we’re joined by Tijmen Blankevoort, a staff engineer at Qualcomm, who leads their compression and quantization research teams. Tijmen is also co-founder of ML startup Scyfer, along with Qualcomm colleague Max Welling, who we spoke with back on episode 267. In our conversation with Tijmen we discuss: 

• The ins and outs of compression and quantization of ML models, specifically NNs,

• How much models can actually be compressed, and the best way to achieve compression, 

• We also look at a few recent papers including “Lottery Hypothesis."  

Check out the full show notes at twimlai.com/talk/292.

 

Aug 19, 2019
Identifying New Materials with NLP with Anubhav Jain - TWIML Talk #291
39:54

Today we are joined by Anubhav Jain, Staff Scientist & Chemist at Lawrence Berkeley National Lab. Anubhav leads the Hacker Materials Research Group, where his research focuses on applying computing to accelerate the process of finding new materials for functional applications. With the immense amount of published scientific research out there, it can be difficult to understand how that information can be applied to future studies, let alone find a way to read it all. In this episode we discuss:

- His latest paper, ‘Unsupervised word embeddings capture latent knowledge from materials science literature’

- The design of a system that takes the literature and uses natural language processing to analyze, synthesize and then conceptualize complex material science concepts

- How the method is shown to recommend materials for functional applications in the future - scientific literature mining at its best.

Check out the complete show notes at twimlai.com/talk/291.

Aug 15, 2019
The Problem with Black Boxes with Cynthia Rudin - TWIML Talk #290
48:25

You asked, we listened! Today, by listener request, we are joined by Cynthia Rudin, Professor of Computer Science, Electrical and Computer Engineering, and Statistical Science at Duke University. Cynthia is passionate about machine learning and social justice, with extensive work and leadership in both areas. In this episode we discuss:

  • Her paper, ‘Please Stop Explaining Black Box Models for High Stakes Decisions’
  • How interpretable models make for less error-prone and more comprehensible decisions - and why we should care
  • A break down of black box and interpretable models, including their development, sample use cases, and more!

Check out the complete show notes at https://twimlai.com/talk/290

Aug 14, 2019
Human-Robot Interaction and Empathy with Kate Darling - TWIML Talk #289
43:56

Today we’re joined by Dr. Kate Darling, Research Specialist at the MIT Media Lab. Kate’s focus is on robot ethics and interaction, namely the social implication of how people treat robots and the purposeful design of robots in our daily lives. This episode is a fascinating look into the intersection of psychology and how we are using technology. We cover topics like:

  • How to measure empathy
  • The impact of robot treatment on kids behavior
  • The correlation between animals and robots 
  • Why ‘successful’ robots aren’t always humanoid and so much more!
Aug 08, 2019
Automated ML for RNA Design with Danny Stoll - TWIML Talk #288
36:29

Today we’re joined by Danny Stoll, Research Assistant at the University of Freiburg. Since high school, Danny has been fascinated by Deep Learning which has grown into a desire to make machine learning available to anyone with interest. Danny’s current research can be encapsulated in his latest paper, ‘Learning to Design RNA’. Designing RNA molecules has become increasingly popular as RNA is responsible for regulating biological process, even connected to diseases like Alzheimers and Epilepsy. In this episode, Danny discusses:

  • The RNA design process through reverse engineering
  • How his team’s deep learning algorithm is applied to train and design sequences
  • Transfer learning & multitask learning
  • Ablation studies, hyperparameter optimization, the difference between chemical and statistical based approaches and more!
Aug 05, 2019
Developing a brain atlas using deep learning with Theofanis Karayannis - TWIML Talk #287
38:37

Today we’re joined by Theofanis Karayannis, Assistant Professor at the Brain Research Institute of the University of Zurich. Theo’s research is currently focused on understanding how circuits in the brain are formed during development and modified by experiences. Working with animal models, Theo segments and classifies the brain regions, then detects cellular signals that make connections throughout and between each region. How? The answer is (relatively) simple: Deep Learning. In this episode we discuss:

  • Adapting DL methods to fit the biological scope of work
  • The distribution of connections that makes neurological decisions in both animals and humans every day
  • The way images of the brain are collected
  • Genetic trackability, and more!
Aug 01, 2019
Environmental Impact of Large-Scale NLP Model Training with Emma Strubell - TWIML Talk #286
38:36

Today we’re joined by Emma Strubell, currently a visiting scientist at Facebook AI Research. Emma’s focus is on NLP and bringing state of the art NLP systems to practitioners by developing efficient and robust machine learning models. Her paper, Energy and Policy Considerations for Deep Learning in NLP, hones in on one of the biggest topics of the generation: environmental impact. In this episode we discuss:

  • How training neural networks have resulted in an increase in accuracy, however the computational resources required to train these models is staggering - and carbon footprints are only getting bigger
  • Emma’s research methods for determining carbon emissions
  • How companies are reacting to environmental concerns
  • What we, as an industry, can be doing better
Jul 29, 2019
“Fairwashing” and the Folly of ML Solutionism with Zachary Lipton - TWIML Talk #285
01:15:39

Today we’re joined by Zachary Lipton, Assistant Professor in the Tepper School of Business. With an overarching theme of data quality and interpretation, Zachary's research and work is focused on machine learning in healthcare, with the goal of not replacing doctors, but to assist through an understanding of the diagnosis and treatment process. Zachary is also working on the broader question of fairness and ethics in machine learning systems across multiple industries. We delve into these topics today, discussing: 

  • Supervised learning in the medical field, 
  • Guaranteed robustness under distribution shifts, 
  • The concept of ‘fairwashing’,
  • How there is insufficient language in machine learning to encompass abstract ethical behavior, and much, much more
Jul 25, 2019
Retinal Image Generation for Disease Discovery with Stephen Odaibo - TWIML Talk #284
41:39

Today we’re joined by Dr. Stephen Odaibo, Founder and CEO of RETINA-AI Health Inc. Stephen’s unique journey to machine learning and AI includes degrees in math, medicine and computer science, which led him to an ophthalmology practice before taking on the ultimate challenge as an entrepreneur. In this episode we discuss:

  • How RETINA-AI Health harnesses the power of machine learning to build autonomous systems that diagnose and treat retinal diseases 
  • The importance of domain experience and how Stephen’s expertise in ophthalmology and engineering along with the current state of both industries that led to the founding of his company
  • His work with GANs to create artificial retinal images and why more data isn’t always better!
Jul 22, 2019
Real world model explainability with Rayid Ghani - TWiML Talk #283
50:58

Today we’re joined by Rayid Ghani, Director of the Center for Data Science and Public Policy at the University of Chicago. Rayid’s goal is to combine his skills in machine learning and data with his desire to improve public policy and the social sector. Drawing on his range of experience from the corporate world to Chief Scientist for the 2012 Obama Campaign, we delve into the world of automated predictions and explainability methods. Here we discuss:

  • How automated predictions can be helpful, but they don’t always paint a full picture 
  • When dealing with public policy and the social sector, the key to an effective explainability method is the correct context
  • Machine feedback loops that help humans override the wrong predictions and reinforce the right ones
  • Supporting proactive intervention through complex explanability tools
Jul 18, 2019
Inspiring New Machine Learning Platforms w/ Bioelectric Computation with Michael Levin - TWiML Talk #282
25:55

Today we’re joined by Michael Levin, Director of the Allen Discovery Institute at Tufts University. Michael joined us back at NeurIPS to discuss his invited talk “What Bodies Think About: Bioelectric Computation Beyond the Nervous System as Inspiration for New Machine Learning Platforms.” In our conversation, we talk about:

  • Synthetic living machines, novel AI architectures and brain-body plasticity
  • How our DNA doesn’t control everything like we thought and how the behavior of cells in living organisms can be modified and adapted
  • Biological systems dynamic remodeling in the future of developmental biology and regenerative medicine...and more!

The complete show notes for this episode can be found at twimlai.com/talk/282

Register for TWIMLcon: AI Platforms now at twimlcon.com!

Jul 15, 2019
Simulation and Synthetic Data for Computer Vision with Batu Arisoy - TWiML Talk #281
41:36

Today we’re joined by Batu Arisoy, Research Manager with the Vision Technologies & Solutions team at Siemens Corporate Technology. Currently, Batu’s research focus is solving limited data computer vision problems, providing R&D for many of the business units throughout the company. In our conversation we discuss:

  • An emulation of a teacher teaching students information without the use of memorization
  • Discerning which parts of our neural network are required to make decisions
  • An activity recognition project with the Office of Naval Research that keeps ‘humans in the loop’ and more.

 The complete show notes for this episode can be found at twimlai.com/talk/281

Register for TWIMLcon: AI Platforms now at twimlcon.com!

Thanks to Siemens for their sponsorship of today's episode! Check out what they’re up to, visit twimlai.com/siemens.

Jul 09, 2019
Spiking Neural Nets and ML as a Systems Challenge with Jeff Gehlhaar - TWIML Talk #280
54:08

Today we’re joined by Jeff Gehlhaar, VP of Technology and Head of AI Software Platforms at Qualcomm. As we’ve explored in our conversations with both Gary Brotman and Max Welling, Qualcomm has a hand in tons of machine learning research and hardware, and our conversation with Jeff is no different. We discuss:

• How the various training frameworks fit into the developer experience when working with their chipsets.

• Examples of federated learning in the wild.

• The role inference will play in data center devices and more.

The complete show notes for this episode can be found at twimlai.com/talk/280

Register for TWIMLcon now at twimlcon.com.

Thanks to Qualcomm for their sponsorship of today's episode! Check out what they're up to at twimlai.com/qualcomm.

Jul 08, 2019
Transforming Oil & Gas with AI with Adi Bhashyam and Daniel Jeavons - TWIML Talk #279
46:27

Today we’re joined by return guest Daniel Jeavons, GM of Data Science at Shell, and Adi Bhashyam, GM of Data Science at C3, who we had the pleasure of speaking to at this years C3 Transform Conference. In our conversation, we discuss:

• The progress that Dan and his team has made since our last conversation, including an overview of their data platform.

• We explore the various types of users of the platform, and how those users informed the decision to use C3’s out-of-the-box platform solution instead of building their own internal platform.

• Adi gives us an overview of the evolution of C3 and their platform, along with a breakdown of a few Shell-specific use cases. 

The complete show notes can be found at twimlai.com/talk/279.

Visit twimlcon.com to learn more about the TWIMLcon: AI Platforms conference! Early-bird registration has been extended until this Wednesday, 7/3, register today for the lowest possible price!!

Jul 01, 2019
Fast Radio Burst Pulse Detection with Gerry Zhang - TWIML Talk #278
38:04

Today we’re joined by Yunfan Gerry Zhang, a PhD student in the Department of Astrophysics at UC Berkely, and an affiliate of Berkeley’s SETI research center. In our conversation, we discuss: 

• Gerry's research on applying machine learning techniques to astrophysics and astronomy.

• His paper “Fast Radio Burst 121102 Pulse Detection and Periodicity: A Machine Learning Approach”.

• We explore the types of data sources used for this project, challenges Gerry encountered along the way, the role of GANs and much more.

The complete show notes can be found at twimlai.com/talk/278.

Visit twimlcon.com to learn more about the TWIMLcon: AI Platforms conference! Early-bird registration ends TOMORROW 6/28! Register now!

Jun 27, 2019
Tracking CO2 Emissions with Machine Learning with Laurence Watson - TWIML Talk #277
41:08

Today we’re joined by Laurence Watson, Co-Founder and CTO of Plentiful Energy and a former data scientist at Carbon Tracker. In our conversation, we discuss:

• Carbon Tracker's goals, and their report “Nowhere to hide: Using satellite imagery to estimate the utilisation of fossil fuel power plants”.

• How they're using computer vision to process satellite images of coal plants, including how the images are labeled

•Various challenges with the scope and scale of this project, including dealing with varied time zones and imbalanced training classes.

The complete show notes can be found at twimlai.com/talk/277.

Visit twimlcon.com to learn more about the TWIMLcon: AI Platforms conference! Early-bird registration ends on 6/28!

Jun 24, 2019
Topic Modeling for Customer Insights at USAA with William Fehlman - TWIML Talk #276
44:27

Today we’re joined by William Fehlman, director of data science at USAA. We caught up with William a while back to discuss:

  • His work on topic modeling, which USAA uses in various scenarios, including chat channels with members via mobile and desktop interfaces.
  • How their datasets are generated.
  • Explored methodologies of topic modeling, including latent semantic indexing, latent Dirichlet allocation, and non-negative matrix factorization.
  • We also explore how terms are represented via a document-term matrix, and how they are scored based on coherence.

The complete show notes can be found at twimlai.com/talk/276.

Visit twimlcon.com to learn more about the TWIMLcon: AI Platforms conference! Early-bird registration ends on 6/28!

Jun 20, 2019
Phronesis of AI in Radiology with Judy Gichoya - TWIML Talk #275
43:04

Today we’re joined by Judy Gichoya an interventional radiology fellow at the Dotter Institute at Oregon Health and Science University. In our conversation, we discuss:

• Judy's research in “Phronesis of AI in Radiology: Superhuman meets Natural Stupidy,” reviewing the claims of “superhuman” AI performance in radiology.

• We explore potential roles in which AI can have success in radiology, along with some of the different types of biases that can manifest themselves across multiple use cases.

• We look at the CheXNet paper, which details how human and AI performance can complement and improve each other's performance for detecting pneumonia in chest X-rays.

The complete show notes can be found at twimlai.com/talk/275.

Visit twimlcon.com to learn more about the TWIMLcon: AI Platforms conference! 

Jun 18, 2019
The Ethics of AI-Enabled Surveillance with Karen Levy - TWIML Talk #274
42:34

Today we’re joined by Karen Levy, assistant professor in the department of information science at Cornell University. Karen’s research focuses on how rules and technologies interact to regulate behavior, especially the legal, organizational, and social aspects of surveillance and monitoring. In our conversation we discuss:

• Examples of how data tracking and surveillance can be used in ways that can be abusive to various marginalized groups, including detailing her extensive research into truck driver surveillance.

• Her thoughts on how the broader society will react to the increase in surveillance,

• The unintended consequences of surveillant systems, questions surrounding hybridization of jobs and systems, and more!

The complete show notes can be found at twimlai.com/talk/274.

Visit twimlcon.com to learn more about the TWIMLcon: AI Platforms conference! 

Jun 14, 2019
Supporting Rapid Model Development at Two Sigma with Matt Adereth & Scott Clark - TWIML Talk #273
49:09

Today we’re joined by Matt Adereth, managing director of investments at Two Sigma, and return guest Scott Clark, co-founder and CEO of SigOpt, to discuss:

• The end to end modeling platform at Two Sigma, who it serves, and challenges faced in production and modeling.

• How Two Sigma has attacked the experimentation challenge with their platform.

• The relationship between the optimization and infrastructure teams at SigOpt.

• What motivates companies that aren’t already heavily invested in platforms, optimization or automation, to do so.

The complete show notes can be found at twimlai.com/talk/273.

Visit twimlcon.com to learn more about the TWIMLcon: AI Platforms conference! The first 10 listeners who register get their ticket for 75% off using the discount code TWIMLFIRST!

Follow along with the entire AI Platforms Vol 2 series at twimlai.com/aiplatforms2.

Thanks to SigOpt for their continued support of the podcast, and their sponsorship of this episode! Check out their machine learning experimentation and optimization suite, and get a free trial at twimlai.com/sigopt.

Jun 11, 2019
Scaling Model Training with Kubernetes at Stripe with Kelley Rivoire - TWIML Talk #272
45:07

Today we’re joined by Kelley Rivoire, engineering manager working on machine learning infrastructure at Stripe. Kelley and I caught up at a recent Strata Data conference to discuss:

• Her talk "Scaling model training: From flexible training APIs to resource management with Kubernetes."

• Stripe’s machine learning infrastructure journey, including their start from a production focus.

• Internal tools used at Stripe, including Railyard, an API built to manage model training at scale & more!

The complete show notes can be found at twimlai.com/talk/272.

Visit twimlcon.com to learn more about the TWIMLcon: AI Platforms conference! The first 10 listeners who register get their ticket for 75% off using the discount code TWIMLFIRST!

Follow along with the entire AI Platforms Vol 2 series at twimlai.com/aiplatforms2.

Thanks to SigOpt for their continued support of the podcast, and their sponsorship of this episode! Check out their machine learning experimentation and optimization suite, and get a free trial at twimlai.com/sigopt.

Jun 06, 2019
Productizing ML at Scale at Twitter with Yi Zhuang - TWIML Talk #271
49:20

Today we continue our AI Platforms series joined by Yi Zhuang, Senior Staff Engineer at Twitter & Tech Lead for Machine Learning Core Environment at Twitter Cortex. In our conversation, we cover: 

• The machine learning landscape at Twitter, including with the history of the Cortex team

• Deepbird v2, which is used for model training and evaluation solutions, and it's integration with Tensorflow 2.0.

• The newly assembled “Meta” team, that is tasked with exploring the bias, fairness, and accountability of their machine learning models, and much more!

The complete show notes can be found at twimlai.com/talk/271.

Visit twimlcon.com to learn more about the TWIMLcon: AI Platforms conference! The first 10 listeners who register get their ticket for 75% off using the discount code TWIMLFIRST!

Follow along with the entire AI Platforms Vol 2 series at twimlai.com/aiplatforms2.

Thanks to SigOpt for their continued support of the podcast, and their sponsorship of this episode! Check out their machine learning experimentation and optimization suite, and get a free trial at twimlai.com/sigopt.

Finally, visit twimlai.com/3bday to help us celebrate TWiML's 3rd Birthday!

Jun 03, 2019
Snorkel: A System for Fast Training Data Creation with Alex Ratner - TWiML Talk #270
45:42

Today we’re joined by Alex Ratner, Ph.D. student at Stanford. In our conversation, we discuss:

• Snorkel, the open source framework that is the successor to Stanford's Deep Dive project.

• How Snorkel is used as a framework for creating training data with weak supervised learning techniques.

• Multiple use cases for Snorkel, including how it is used by large companies like Google. 

The complete show notes can be found at twimlai.com/talk/270.

Follow along with the entire AI Platforms Vol 2 series at twimlai.com/aiplatforms2.

Thanks to SigOpt for their continued support of the podcast, and their sponsorship of this episode! Check out their machine learning experimentation and optimization suite, and get a free trial at twimlai.com/sigopt.

Finally, visit twimlai.com/3bday to help us celebrate TWiML's 3rd Birthday!

May 30, 2019
Advancing Autonomous Vehicle Development Using Distributed Deep Learning with Adrien Gaidon - TWiML Talk #269
50:04

In this, the kickoff episode of AI Platforms Vol. 2, we're joined by Adrien Gaidon, Machine Learning Lead at Toyota Research Institute. Adrien and I caught up to discuss his team’s work on deploying distributed deep learning in the cloud, at scale. In our conversation, we discuss: 

• The beginning and gradual scaling up of TRI's platform.

• Their distributed deep learning methods, including their use of stock Pytorch.

• Applying devops to their research infrastructure, and much more!

The complete show notes for this episode can be found at twimlai.com/talk/269.

Thanks to SigOpt for their continued support of the podcast, and their sponsorship of this episode! Check out their machine learning experimentation and optimization suite, and get a free trial at twimlai.com/sigopt.

Finally, visit twimlai.com/3bday to help us celebrate TWiML's 3rd Birthday!

May 28, 2019
Are We Being Honest About How Difficult AI Really Is? w/ David Ferrucci - TWiML Talk #268
52:36

Today we’re joined by David Ferrucci, Founder, CEO, and Chief Scientist at Elemental Cognition, a company focused on building natural learning systems that understand the world the way people do. In our conversation, we discuss: 

• His experience leading the team that built the IBM Watson system that won on Jeopardy.


• The role of “understanding” in the context of AI systems, and the types of commitments and investments needed to achieve even modest levels of understanding in these systems.

• His thoughts on the power of deep learning, what the path to AGI looks like, and the need for hybrid systems to get there.

The complete show notes for this episode can be found at twimlai.com/talk/268.

Visit twimlai.com/3bday to help us celebrate TWiML's 3rd Birthday!

 

May 23, 2019
Gauge Equivariant CNNs, Generative Models, and the Future of AI with Max Welling - TWiML Talk #267
01:04:26

Today we’re joined by Max Welling, research chair in machine learning at the University of Amsterdam, as well as VP of technologies at Qualcomm, and Fellow at the Canadian Institute for Advanced Research, or CIFAR. In our conversation, we discuss: 

• Max’s research at Qualcomm AI Research and the University of Amsterdam, including his work on Bayesian deep learning, Graph CNNs and Gauge Equivariant CNNs, and in power efficiency for AI via compression, quantization, and compilation.

• Max’s thoughts on the future of the AI industry, in particular, the relative importance of models, data and compute.

The complete show notes for this episode can be found at twimlai.com/talk/267.

Thanks to Qualcomm for sponsoring today's episode! Check out what they're up to at twimlai.com/qualcomm.

 

May 20, 2019
Can We Trust Scientific Discoveries Made Using Machine Learning? with Genevera Allen - TWiML Talk #266
41:54

Today we’re joined by Genevera Allen, associate professor of statistics in the EECS Department at Rice University, Founder and Director of the Rice Center for Transforming Data to Knowledge and Investigator with the Neurological Research Institute with the Baylor College of Medicine.

Genevera caused quite the stir at the American Association for the Advancement of Science meeting earlier this year with her presentation “Can We Trust Data-Driven Discoveries?" In our conversation we cover:

• The goal of Genevera's talk, and what was lost in translation.

• Use cases outlining the shortcomings of current machine learning techniques.

• Reproducibility, including inference vs discovery, and the lack of terminology for many of the various reproducibility issues, & much more!

The complete show notes for this episode can be found at twimlai.com/talk/266.

 

May 16, 2019
Creative Adversarial Networks for Art Generation with Ahmed Elgammal - TWiML Talk #265
37:13

Today we’re joined by Ahmed Elgammal, a professor in the department of computer science at Rutgers, and director of The Art and Artificial Intelligence Lab. In my conversation with Ahmed, we discuss:

• His work on AICAN, a creative adversarial network that produces original portraits, trained with over 500 years of European canonical art.

• How complex the computational representations of the art actually are, and how he simplifies them.

• Specifics of the training process, including the various types of artwork used, and the constraints applied to the model.

The complete show notes for this episode can be found at twimlai.com/talk/265.

May 13, 2019
Diagnostic Visualization for Machine Learning with YellowBrick w/ Rebecca Bilbro - TWiML Talk #264
42:34

Today we close out our PyDataSci series joined by Rebecca Bilbro, head of data science at ICX media and co-creator of the popular open-source visualization library YellowBrick.

In our conversation, Rebecca details:

• Her relationship with toolmaking, which led to the eventual creation of Yellowbrick.

• Popular tools within YellowBrick, including a summary of their unit testing approach.

• Interesting use cases that she’s seen over time.

• The growth she’s seen in the community of contributors and examples of their contributions as they approach the release of YellowBrick 1.0.

The complete show notes for this episode can be found at twimlai.com/talk/264. Check out the rest of the PyDataSci series at twimlai.com/pydatasci.

We want to better understand your views on the importance of open source and the projects and players in this space. To access the survey visit twimlai.com/pythonsurvey.

Thanks to this weeks sponsor, IBM, for their support of the podcast! Visit twimlai.com/ibm to learn more about the IBM Data Science Community.

May 10, 2019
Librosa: Audio and Music Processing in Python with Brian McFee - TWiML Talk #263
39:10

Today we continue our PyDataSci series joined by Brian McFee, assistant professor of music technology and data science at NYU, and creator of LibROSA, a python package for music and audio analysis.

Brian walks us through his experience building LibROSA, including:

• Detailing the core functions provided in the library,

• His experience working within Jupyter Notebook,

• We explore a typical LibROSA workflow & more!

The complete show notes for this episode can be found at twimlai.com/talk/263.

Check out the rest of the PyDataSci series at twimlai.com/pydatasci.

We want to better understand your views on the importance of open source and the projects and players in this space. To access the survey visit twimlai.com/pythonsurvey.

Thanks to this weeks sponsor, IBM, for their support of the podcast! Visit twimlai.com/ibm to learn more about the IBM Data Science Community.

May 09, 2019
Practical Natural Language Processing with spaCy and Prodigy w/ Ines Montani - TWiML Talk #262
49:39

In this episode of PyDataSci, we’re joined by Ines Montani, Cofounder of Explosion, Co-developer of SpaCy and lead developer of Prodigy.

Ines and I caught up to discuss her various projects, including the aforementioned SpaCy, an open-source NLP library built with a focus on industry and production use cases.

The complete show notes for this episode can be found at twimlai.com/talk/262. Check out the rest of the PyDataSci series at twimlai.com/pydatasci.

We want to better understand your views on the importance of open source and the projects and players in this space. To access the survey visit twimlai.com/pythonsurvey.

Thanks to this weeks sponsor, IBM, for their support of the podcast! Visit twimlai.com/ibm to learn more about the IBM Data Science Community. 

May 07, 2019
Scaling Jupyter Notebooks with Luciano Resende - TWiML Talk #261
34:28

Today we kick off PyDataSci with Luciano Resende, an Open Source AI Platform Architect at IBM and part of the Center for Open Source Data and AI Technology.

Luciano and I caught up to discuss his work on Jupyter Enterprise Gateway, a scalable way to share Jupyter notebooks and other resources in an enterprise environment. In our conversation, we discuss some of the challenges that arise using Jupyter Notebooks at scale, the role of open source projects like Jupyter Hub and Enterprise Gateway, and some potential reasons for investing in and building custom notebooks. We also explore some common requests from the community, such as tighter integration with git repositories, as well as the python-centricity of the vast Jupyter ecosystem.

The complete show notes for this episode can be found at twimlai.com/talk/261. Check out the rest of the PyDataSci series at twimlai.com/pydatasci.

Thanks to this weeks sponsor, IBM, for their support of the podcast! Visit twimlai.com/ibm to learn more about the IBM Data Science Community. 

 

May 06, 2019
Fighting Fake News and Deep Fakes with Machine Learning w/ Delip Rao - TWiML Talk #260
58:40

Today we’re joined by Delip Rao, vice president of research at the AI Foundation, co-author of the book Natural Language Processing with PyTorch, and creator of the Fake News Challenge.

Our conversation begins with the origin story of the Fake News Challenge, including Delip’s initial motivations for the project, and what some of his key takeaways were from that experience. We then dive into a discussion about the generation and detection of artificial content, including “fake news” and “deep fakes.” We discuss the state of generation and detection for text, video, and audio, the key challenges in each of these modalities, the role of GANs on both sides of the equation, and other potential solutions. Finally, we discuss Delip’s new book, Natural Language Processing with PyTorch and his philosophy behind writing it.

The complete show notes for this episode can be found at https://twimlai.com/talk/260.

For more from the AI Conference NY series, visit twimlai.com/nyai19.

Thanks to our friends at HPE for sponsoring this week's series of shows from the O’Reilly AI Conference in New York City! For more information on HPE InfoSight, visit twimlai.com/hpe.

 

May 03, 2019
Maintaining Human Control of Artificial Intelligence with Joanna Bryson - TWiML Talk #259
38:11

Today we’re joined by Joanna Bryson, Reader at the University of Bath.

I was fortunate to catch up with Joanna at the AI Conference where she presented on “Maintaining Human Control of Artificial Intelligence,“ focusing on technological and policy mechanisms that could be used to achieve that goal. In our conversation, we explore our current understanding of “natural intelligence” and how it can inform the development of AI, the context in which she uses the term “human control” and its implications, and the meaning of and need to apply “DevOps” principles when developing AI systems. This was a fun one!

The complete show notes for this episode can be found at https://twimlai.com/talk/259.

For more from the AI Conference NY series, visit twimlai.com/nyai19.

Thanks to our friends at HPE for sponsoring this week's series of shows from the O’Reilly AI Conference in New York City! For more information on HPE InfoSight, visit twimlai.com/hpe.

May 01, 2019
Intelligent Infrastructure Management with Pankaj Goyal & Rochna Dhand - TWiML Talk #258
44:49

Today we kick off our AI conference NY series with Pankaj Goyal, VP for AI & HPC product management at HPE, and Rochna Dhand, director of product management for HPE InfoSight.


Today we get things kicked off with Pankaj Goyal, VP for AI & HPC product management at HPE, and Rochna Dhand, director of product management for HPE InfoSight. In our conversation, Pankaj shares some examples of the kind of AI projects HPE is working with customers on And Rochna details hows HPE’s Infosight helps IT organizations better manage and ensure the health of an enterprise’s IT infrastructure using machine learning. We discuss the key use cases addressed by InfoSight, the types of models it uses for its analysis and some of the results seen in real-world deployments.

The complete show notes for this episode can be found at https://twimlai.com/talk/258.

For more from the AI Conference NY series, visit twimlai.com/nyai19.

Thanks to our friends at HPE for sponsoring this week's series of shows from the O’Reilly AI Conference in New York City! For more information on HPE InfoSight, visit twimlai.com/hpe.

Apr 29, 2019
Organizing for Successful Data Science at Stitch Fix with Eric Colson - TWiML Talk #257
52:38

For the final episode of our Strata Data series, we’re joined by Eric Colson, Chief Algorithms Officer at Stitch Fix, whose presentation at the conference explored “How to make fewer bad decisions.”

Our discussion focuses in on the three key organizational principles for data science teams that he’s developed at Stitch Fix. Along the way, we also talk through the various roles data science plays at the company, explore a few of the 800+ algorithms in use at the company spanning recommendations, inventory management, demand forecasting, and clothing design. We discuss the roles of Stitch Fix’splatforms team in supporting the data science organization, and his unique perspective on how to identify platform features.

The complete show notes for this episode can be found at https://twimlai.com/talk/257.

For more from the Strata Data conference series, visit twimlai.com/stratasf19.

I want to send a quick thanks to our friends at Cloudera for their sponsorship of this series of podcasts from the Strata Data Conference, which they present along with O’Reilly Media. Cloudera’s long been a supporter of the podcast; in fact, they sponsored the very first episode of TWiML Talk, recorded back in 2016. Since that time Cloudera has continued to invest in and build out its platform, which already securely hosts huge volumes of enterprise data, to provide enterprise customers with a modern environment for machine learning and analytics that works both in the cloud as well as the data center. In addition, Cloudera Fast Forward Labs provides research and expert guidance that helps enterprises understand the realities of building with AI technologies without needing to hire an in-house research team. To learn more about what the company is up to and how they can help, visit Cloudera’s Machine Learning resource center at cloudera.com/ml.

Apr 26, 2019
End-to-End Data Science to Drive Business Decisions at LinkedIn with Burcu Baran - TWiML Talk #256
49:51

In this episode of our Strata Data conference series, we’re joined by Burcu Baran, Senior Data Scientist at LinkedIn.

At Strata, Burcu, along with a few members of her team, delivered the presentation “Using the full spectrum of data science to drive business decisions,” which outlines how LinkedIn manages their entire machine learning production process. In our conversation, Burcu details each phase of the process, including problem formulation, monitoring features, A/B testing and more. We also discuss how her “horizontal” team works with other more “vertical” teams within LinkedIn, various challenges that arise when training and modeling such as data leakage and interpretability, best practices when trying to deal with data partitioning at scale, and of course, the need for a platform that reduces the manual pieces of this process, promoting efficiency.

The complete show notes for this episode can be found at https://twimlai.com/talk/256.

For more from the Strata Data conference series, visit twimlai.com/stratasf19.

I want to send a quick thanks to our friends at Cloudera for their sponsorship of this series of podcasts from the Strata Data Conference, which they present along with O’Reilly Media. Cloudera’s long been a supporter of the podcast; in fact, they sponsored the very first episode of TWiML Talk, recorded back in 2016. Since that time Cloudera has continued to invest in and build out its platform, which already securely hosts huge volumes of enterprise data, to provide enterprise customers with a modern environment for machine learning and analytics that works both in the cloud as well as the data center. In addition, Cloudera Fast Forward Labs provides research and expert guidance that helps enterprises understand the realities of building with AI technologies without needing to hire an in-house research team. To learn more about what the company is up to and how they can help, visit Cloudera’s Machine Learning resource center at cloudera.com/ml.

I’d also like to send a huge thanks to LinkedIn for their continued support and sponsorship of the show! Now that I’ve had a chance to interview several of the folks on LinkedIn’s Data Science and Engineering teams, it’s really put into context the complexity and scale of the problems that they get to work on in their efforts to create enhanced economic opportunities for every member of the global workforce. AI and ML are integral aspects of almost every product LinkedIn builds for its members and customers and their massive, highly structured dataset gives their data scientists and researchers the ability to conduct applied research to improve member experiences. To learn more about the work of LinkedIn Engineering, please visit engineering.linkedin.com/blog.

Apr 24, 2019
Learning with Limited Labeled Data with Shioulin Sam - TWiML Talk #255
44:36

Today, in the first episode of our Strata Data conference series, we’re joined by Shioulin Sam, Research Engineer with Cloudera Fast Forward Labs.

Shioulin and I caught up to discuss the newest report to come out of CFFL, “Learning with Limited Label Data,” which explores active learning as a means to build applications requiring only a relatively small set of labeled data. We start our conversation with a review of active learning and some of the reasons why it’s recently become an interesting technology for folks building systems based on deep learning. We then discuss some of the differences between active learning approaches or implementations, and some of the common requirements of an active learning system. Finally, we touch on some packaged offerings in the marketplace that include active learning, including Amazon’s SageMaker Ground Truth, and review Shoulin’s tips for getting started with the technology.

The complete show notes for this episode can be found at https://twimlai.com/talk/255.

For more from the Strata Data conference series, visit twimlai.com/stratasf19.

I want to send a quick thanks to our friends at Cloudera for their sponsorship of this series of podcasts from the Strata Data Conference, which they present along with O’Reilly Media. Cloudera’s long been a supporter of the podcast; in fact, they sponsored the very first episode of TWiML Talk, recorded back in 2016. Since that time Cloudera has continued to invest in and build out its platform, which already securely hosts huge volumes of enterprise data, to provide enterprise customers with a modern environment for machine learning and analytics that works both in the cloud as well as the data center. In addition, Cloudera Fast Forward Labs provides research and expert guidance that helps enterprises understand the realities of building with AI technologies without needing to hire an in-house research team. To learn more about what the company is up to and how they can help, visit Cloudera’s Machine Learning resource center at cloudera.com/ml.

Apr 22, 2019
cuDF, cuML & RAPIDS: GPU Accelerated Data Science with Paul Mahler - TWiML Talk #254
38:13

Today we're joined by Paul Mahler, senior data scientist and technical product manager for machine learning at NVIDIA.

In our conversation, Paul and I discuss NVIDIA's RAPIDS open source project, which aims to bring GPU acceleration to traditional data science workflows and machine learning tasks. We dig into the various subprojects like cuDF and cuML that make up the RAPIDS ecosystem, as well as the role of lower-level libraries like mlprims and the relationship to other open-source projects like Scikit-learn, XGBoost and Dask.

The complete show notes for this episode can be found at https://twimlai.com/talk/254.

Visit twimlai.com/gtc19 for more from our GTC 2019 series.

To learn more about Dell Precision workstations, and some of the ways they’re being used by customers in industries like Media and Entertainment, Engineering and Manufacturing, Healthcare and Life Sciences, Oil and Gas, and Financial services, visit Dellemc.com/Precision.

Apr 19, 2019
Edge AI for Smart Manufacturing with Trista Chen - TWiML Talk #253
38:39

Today we’re joined by Trista Chen, chief scientist of machine learning at Inventec.

At GTC, Trista spoke on “Edge AI in Smart Manufacturing: Defect Detection and Beyond.” In our conversation, we discuss a few of the challenges that Industry 4.0 initiatives aim to address and dig into a few of the various use cases she’s worked on, such as the deployment of machine learning in an industrial setting to perform defect detection, safety improvement, demand forecasting, and more. We also dig into the role of edge, cloud, and what she calls hybrid AI, which is inference happening both in the cloud and on the edge concurrently. Finally, we discuss the challenges associated with estimating the ROI of industrial AI projects and the need that often arises to redefine the problem to understand the ultimate impact of the solution.

The complete show notes for this episode can be found at https://twimlai.com/talk/253.

Visit twimlai.com/gtc19 for more from our GTC 2019 series.

To learn more about Dell Precision workstations, and some of the ways they’re being used by customers in industries like Media and Entertainment, Engineering and Manufacturing, Healthcare and Life Sciences, Oil and Gas, and Financial services, visit Dellemc.com/Precision.

Apr 18, 2019
Machine Learning for Security and Security for Machine Learning with Nicole Nichols - TWiML Talk #252
41:56

Today we’re joined by Nicole Nichols, a senior research scientist at the Pacific Northwest National Lab.

Nicole joined me to discuss her recent presentation at GTC, which was titled “Machine Learning for Security, and Security for Machine Learning.” Our conversation explores the two use cases she presented, insider threat detection, and software fuzz testing. We discuss the effectiveness of standard and bidirectional RNN language models for detecting malicious activity within the Los Alamos National Laboratory cybersecurity dataset, the augmentation of software fuzzing techniques using deep learning, and light-based adversarial attacks on image classification systems. I’d love to hear your thoughts on these use cases!

The complete show notes for this episode can be found at https://twimlai.com/talk/252.

Visit twimlai.com/gtc19 for more from our GTC 2019 series.

To learn more about Dell Precision workstations, and some of the ways they’re being used by customers in industries like Media and Entertainment, Engineering and Manufacturing, Healthcare and Life Sciences, Oil and Gas, and Financial services, visit Dellemc.com/Precision.

Apr 16, 2019
Domain Adaptation and Generative Models for Single Cell Genomics with Gerald Quon - TWiML Talk #251
32:24

Today we’re joined by Gerald Quon, assistant professor in the Molecular and Cellular Biology department at UC Davis.

Gerald presented his work on Deep Domain Adaptation and Generative Models for Single Cell Genomics at GTC this year, which explores single cell genomics as a means of disease identification for treatment. In our conversation, we discuss how Gerald and his team use deep learning to generate novel insights across diseases, the different types of data that was used, and the development of ‘nested’ Generative Models for single cell measurement.

The complete show notes for this episode can be found at https://twimlai.com/talk/251.

Visit twimlai.com/gtc19 for more from our GTC 2019 series.

To learn more about Dell Precision workstations, and some of the ways they’re being used by customers in industries like Media and Entertainment, Engineering and Manufacturing, Healthcare and Life Sciences, Oil and Gas, and Financial services, visit Dellemc.com/Precision.

Apr 15, 2019
Mapping Dark Matter with Bayesian Neural Networks w/ Yashar Hezaveh - TWiML Talk #250
36:04

You might have seen the news yesterday that MIT researcher Katie Bouman produced the first image of a black hole. What’s been less reported is that the algorithm she developed to accomplish this is based on machine learning. Machine learning is having a huge impact in the fields of astronomy and astrophysics, and I’m excited to bring you interviews with some of the people innovating in this area.

Today we’re joined by Yashar Hezaveh, Assistant Professor at the University of Montreal, and Research Fellow at the Center for Computational Astrophysics at Flatiron Institute.

Yashar and I caught up to discuss his work on gravitational lensing, which is the bending of light from distant sources due to the effects of gravity. In our conversation, Yashar and I discuss how machine learning can be applied to undistort images, including some of the various techniques used and how the data is prepared to get the best results. We also discuss the intertwined roles of simulation and machine learning in generating images, incorporating other techniques such as domain transfer or GANs, and how he assesses the results of this project.

For even more on this topic, I’d also suggest checking out the following interviews, TWiML Talk #117 with Chris Shallue, where we discuss the discovery of exoplanets, TWiML Talk #184, with Viviana Acquaviva, where we explore dark energy and star formation, and if you want to go way back, TWiML Talk #5 with Joshua Bloom which provides a great overview of the application of ML in astronomy.

Thanks to Pegasystems for sponsoring today's show! I'd like to invite you to join me at PegaWorld, the company’s annual digital transformation conference, which takes place this June in Las Vegas. To learn more about the conference or to register, visit pegaworld.com and use TWIML19 in the promo code field when you get there for $200 off.

The complete show notes for this episode can be found at https://twimlai.com/talk/250.

Apr 11, 2019
Deep Learning for Population Genetic Inference with Dan Schrider - TWiML Talk #249
49:53

Today we’re joined by Dan Schrider, assistant professor in the department of genetics at The University of North Carolina at Chapel Hill.

My discussion with Dan starts with an overview of population genomics and from there digs into his application of machine learning in the field, allowing us to, for example, better understand population size changes and gene flow from DNA sequences. We then dig into Dan’s paper “The Unreasonable Effectiveness of Convolutional Neural Networks in Population Genetic Inference,” which was published in the Molecular Biology and Evolution journal, which examines the idea that CNNs are capable of outperforming expert-derived statistical methods for some key problems in the field.

Thanks to Pegasystems for sponsoring today's show! I'd like to invite you to join me at PegaWorld, the company’s annual digital transformation conference, which takes place this June in Las Vegas. To learn more about the conference or to register, visit pegaworld.com and use TWIML19 in the promo code field when you get there for $200 off.

The complete show notes for this episode can be found at https://twimlai.com/talk/249.

Apr 09, 2019
Empathy in AI with Rob Walker - TWiML Talk #248
41:26

Today we’re joined by Rob Walker, Vice President of Decision Management at Pegasystems.

Rob joined us back in episode 127 to discuss “Hyperpersonalizing the customer experience.” Today, he’s back for a discussion about the role of empathy in AI systems. In our conversation, we dig into the role empathy plays in consumer-facing human-AI interactions, the differences between empathy and ethics, and a few examples of ways empathy should be considered when building enterprise AI systems.

What do you think? Should empathy be a consideration in AI systems? If so, do any examples jump out for you of where and how it should be applied? I’d love to hear your thoughts on the topic! Feel free to shoot me a tweet at @samcharrington or leave a comment on the show notes page with your thoughts.

Thanks to Pegasystems for sponsoring today's show! I'd like to invite you to join me at PegaWorld, the company’s annual digital transformation conference, which takes place this June in Las Vegas. To learn more about the conference or to register, visit pegaworld.com and use TWIML19 in the promo code field when you get there for $200 off.

The complete show notes for this episode can be found at https://twimlai.com/talk/248.

Apr 05, 2019
Benchmarking Custom Computer Vision Services at Urban Outfitters with Tom Szumowski - TWiML Talk #247
50:38

Today we’re joined by Tom Szumowski, Data Scientist at URBN, the parent company of Urban Outfitters, Anthropologie, and other consumer fashion brands.

Tom and I caught up recently to discuss his project “Exploring Custom Vision Services for Automated Fashion Product Attribution.” We start our discussion with a definition of the product attribution problem in retail and fashion, and a discussion of the challenges it offers to data scientists. We then look at the process Tom and his team took to building custom attribution models, and the results of their evaluation of various custom vision APIs for this purpose, with a focus on the various roadblocks and lessons he and his team encountered along the way.

Thanks to Pegasystems for sponsoring today's show! I'd like to invite you to join me at PegaWorld, the company’s annual digital transformation conference, which takes place this June in Las Vegas. To learn more about the conference or to register, visit pegaworld.com and use TWIML19 in the promo code field when you get there for $200 off.

The complete show notes for this episode can be found at https://twimlai.com/talk/247.

Apr 03, 2019
Pragmatic Quantum Machine Learning with Peter Wittek - TWiML Talk #245
01:06:59

Today we’re joined by Peter Wittek, Assistant Professor at the University of Toronto working on quantum-enhanced machine learning and the application of high-performance learning algorithms in quantum physics.

Peter and I caught up back in November to discuss a presentation he gave at re:Invent, “Pragmatic Quantum Machine Learning Today.” In our conversation, we start with a bit of background including the current state of quantum computing, a look ahead to what the next 20 years of quantum computing might hold, and how current quantum computers are flawed. We then dive into our discussion on quantum machine learning, and Peter’s new course on the topic, which debuted in February. I’ll link to that in the show notes. Finally, we briefly discuss the work of Ewin Tang, a PhD student from the University of Washington, who’s undergrad thesis “A quantum-inspired classical algorithm for recommendation systems,” made quite a stir last summer. As a special treat for those interested, I’m also sharing my interview with Ewin as a bonus episode alongside this one. I’d love to hear your thoughts on how you think quantum computing will impact machine learning in the next 20 years! Send me a tweet or leave a comment on the show notes page.

Thanks to Pegasystems for sponsoring today's show! I'd like to invite you to join me at PegaWorld, the company’s annual digital transformation conference, which takes place this June in Las Vegas. To learn more about the conference or to register, visit pegaworld.com and use TWIML19 in the promo code field when you get there for $200 off.

The complete show notes for this episode can be found at https://twimlai.com/talk/245.

Apr 01, 2019
*Bonus Episode* A Quantum Machine Learning Algorithm Takedown with Ewin Tang - TWiML Talk #246
40:03

In this special bonus episode of the podcast, I’m joined by Ewin Tang, a PhD student in the Theoretical Computer Science group at the University of Washington.

In our conversation, Ewin and I dig into her paper “A quantum-inspired classical algorithm for recommendation systems,” which took the quantum computing community by storm last summer. We haven’t called out a Nerd-Alert interview in a long time, but this interview inspired us to dust off that designation, so get your notepad ready!

The complete show notes for this episode can be found at https://twimlai.com/talk/246.

Apr 01, 2019
Supporting TensorFlow at Airbnb with Alfredo Luque - TWiML Talk #244
40:57

This interview features my conversation with Alfredo Luque, a software engineer on the machine infrastructure team at Airbnb.

If you’re among the many TWiML fans interested in AI Platforms and ML infrastructure, you probably remember my interview with Airbnb’s Atul Kale, in which we discussed their Bighead platform. In my conversation with Alfredo, we dig a bit deeper into Bighead’s support for TensorFlow, discuss a recent image categorization challenge they solved with the framework, and explore what the new 2.0 release means for their users. The complete show notes for this episode can be found at https://twimlai.com/talk/244

I’d like to send a huge thanks to the TensorFlow team for helping us bring you this podcast series and giveaway. With all the great announcements coming out of the TensorFlow Dev Summit, including the 2.0 alpha, you should definitely check out the latest and greatest at https://tensorflow.org where you can also download and start building with the framework.

In conjunction with the TensorFlow 2.0 alpha release, and our TensorFlow Dev Summit series, we invite you to enter our TensorFlow Edge Kit Giveaway. Winners will receive a gift box from Google that includes some fun toys including the new Coral Edge TPU device and the SparkFun Edge development board powered by TensorFlow. Find out more at https://twimlai.com/tfgiveaway.

Mar 28, 2019
Mining the Vatican Secret Archives with TensorFlow w/ Elena Nieddu - TWiML Talk #243
44:06

Today we’re joined by Elena Nieddu, PhD Student at Roma Tre University, who presented on her project “In Codice Ratio” at the TF Dev Summit.

In our conversation, Elena provides an overview of the project, which aims to annotate and transcribe Vatican secret archive documents via machine learning. We discuss the many challenges associated with transcribing this vast archive of handwritten documents, including overcoming the high cost of data annotation. I think you’ll agree that her team’s approach to that challenge was particularly creative. The complete show notes for this episode can be found at https://twimlai.com/talk/243

I’d like to send a huge thanks to the TensorFlow team for helping us bring you this podcast series and giveaway. With all the great announcements coming out of the TensorFlow Dev Summit, including the 2.0 alpha, you should definitely check out the latest and greatest at https://tensorflow.org where you can also download and start building with the framework.

In conjunction with the TensorFlow 2.0 alpha release, and our TensorFlow Dev Summit series, we invite you to enter our TensorFlow Edge Kit Giveaway. Winners will receive a gift box from Google that includes some fun toys including the new Coral Edge TPU device and the SparkFun Edge development board powered by TensorFlow. Find out more at https://twimlai.com/tfgiveaway.

Mar 27, 2019
Exploring TensorFlow 2.0 with Paige Bailey - TWiML Talk #242
41:17

Today we're joined by Paige Bailey, a TensorFlow developer advocate at Google to discuss the TensorFlow 2.0 alpha release.

Paige and I sat down to talk through the latest TensorFlow updates, and we cover a lot of ground, including the evolution of the TensorFlow APIs and the role of eager mode, tf.keras and tf.function, the evolution of TensorFlow for Swift and its inclusion in the new fast.ai course, new updates to TFX (or TensorFlow Extended), Google’s end-to-end machine learning platform, the emphasis on community collaboration with TF 2.0, and a bunch more. The complete show notes for this episode can be found at https://twimlai.com/talk/242

I’d like to send a huge thanks to the TensorFlow team for helping us bring you this podcast series and giveaway. With all the great announcements coming out of the TensorFlow Dev Summit, including the 2.0 alpha, you should definitely check out the latest and greatest at https://tensorflow.org where you can also download and start building with the framework.

In conjunction with the TensorFlow 2.0 alpha release, and our TensorFlow Dev Summit series, we invite you to enter our TensorFlow Edge Kit Giveaway. Winners will receive a gift box from Google that includes some fun toys including the new Coral Edge TPU device and the SparkFun Edge development board powered by TensorFlow. Find out more at https://twimlai.com/tfgiveaway.

 

 

Mar 25, 2019
Privacy-Preserving Decentralized Data Science with Andrew Trask - TWiML Talk #241
32:40

Today we’re joined by Andrew Trask, PhD student at the University of Oxford and Leader of the OpenMined Project.

OpenMined is an open-source community focused on researching, developing, and promoting tools for secure, privacy-preserving, value-aligned artificial intelligence. Andrew and I caught up back at NeurIPS to dig into why OpenMined is important and explore some of the basic research and technologies supporting Private, Decentralized Data Science. We touch on ideas such as Differential Privacy, and Secure Multi-Party Computation, and how these ideas come into play in, for example, federated learning.

Thanks to Pegasystems for sponsoring today's show! I'd like to invite you to join me at PegaWorld, the company’s annual digital transformation conference, which takes place this June in Las Vegas. To learn more about the conference or to register, visit pegaworld.com and use TWIML19 in the promo code field when you get there for $200 off.

The complete show notes for this episode can be found at https://twimlai.com/talk/241.

Mar 21, 2019
The Unreasonable Effectiveness of the Forget Gate with Jos Van Der Westhuizen - TWiML Talk #240
33:32

Today we’re joined by Jos Van Der Westhuizen, PhD student in Engineering at Cambridge University.

Jos’ research focuses on applying LSTMs, or Long Short-Term Memory neural networks, to biological data for various tasks. In our conversation, we discuss his paper The unreasonable effectiveness of the forget gate, in which he explores the various “gates” that make up an LSTM module and the general impact of getting rid of gates on the computational intensity of training the networks. Jos eventually determines that leaving only the forget-gate results in an unreasonably effective network, and we discuss why. Jos also gives us some great LSTM related resources, including references to Jurgen Schmidhuber, whose research group invented the LSTM, and who I spoke to back in Talk #44.

Thanks to Pegasystems for sponsoring today's show! I'd like to invite you to join me at PegaWorld, the company’s annual digital transformation conference, which takes place this June in Las Vegas. To learn more about the conference or to register, visit pegaworld.com and use TWIML19 in the promo code field when you get there for $200 off.

The complete show notes for this episode can be found at https://twimlai.com/talk/240.

Mar 18, 2019
Building a Recommendation Agent for The North Face with Andrew Guldman - TWiML Talk #239
48:42

Today we’re joined by Andrew Guldman, VP of Product Engineering and Research and Development at Fluid.

Andrew and I caught up a while back to discuss Fluid XPS, a user experience built to help the casual shopper decide on the best product choices during online retail interactions. While XPS has expanded since we recorded this interview, we specifically discuss its origins as a product to assist outerwear retailer The North Face. In our conversation, we discuss their use of heat-sink algorithms and graph databases, and their use of chat and other interfaces, and the challenges associated with staying on top of a constantly changing technology landscape. This was a fun interview!

Thanks to Pegasystems for sponsoring today's show! I'd like to invite you to join me at PegaWorld, the company’s annual digital transformation conference, which takes place this June in Las Vegas. To learn more about the conference or to register, visit pegaworld.com and use TWIML19 in the promo code field when you get there.

The complete show notes for this episode can be found at https://twimlai.com/talk/239.

Mar 14, 2019
Active Learning for Materials Design with Kevin Tran - TWiML Talk #238
34:55

Today we’re joined by Kevin Tran, PhD student in the department of chemical engineering at Carnegie Mellon University.

Kevin’s research focuses on creating and using automated, active learning workflows to perform density functional theory, or DFT, simulations, which are used to screen for new catalysts for a myriad of materials applications. In our conversation, we explore the challenges surrounding one such application—the creation of renewable energy fuel cells, which is discussed in his recent Nature paper “Active learning across intermetallics to guide discovery of electrocatalysts for CO2 reduction and H2 evolution.” We dig into the role and need for good catalysts in this application, the role that quantum mechanics plays in finding them, and how Kevin uses machine learning and optimization to predict electrocatalyst performance.

The complete show notes for this episode can be found at twimlai.com/talk/238.

The Artificial Intelligence Conference is returning to New York in April and we have one FREE conference pass for a lucky listener! Visit twimlai.com/ainygiveaway to enter!

Mar 11, 2019
Deep Learning in Optics with Aydogan Ozcan - TWiML Talk #237
42:07

Today, we’re joined by Aydogan Ozcan, Professor of Electrical and Computer Engineering at UCLA, where his research group focuses on photonics and its applications to nano- and biotechnology.

In our conversation, we explore his group's research into the intersection of deep learning and optics, holography and computational imaging. We specifically look at a really interesting project to create all-optical neural networks which work based on diffraction, where the printed pixels of the network are analogous to neurons. We also explore some of the practical applications for their research and other areas of interest for their group.

The complete show notes for this episode can be found at twimlai.com/talk/237

Be sure to subscribe to our weekly newsletter at twimlai.com/newsletter!

Mar 07, 2019
Scaling Machine Learning on Graphs at LinkedIn with Hema Raghavan and Scott Meyer - TWiML Talk #236
47:01

Today we’re joined by Hema Raghavan and Scott Meyer of LinkedIn.

Hema is an Engineering Director Responsible for AI for Growth and Notifications, while Scott serves as a Principal Software Engineer. In this conversation, Hema, Scott and I dig into the graph database and machine learning systems that power LinkedIn features such as “People You May Know” and second-degree connections. Hema shares her insight into the motivations for LinkedIn’s use of graph-based models and some of the challenges surrounding using graphical models at LinkedIn’s scale, while Scott details his work on the software used at the company to support its biggest graph databases.

We'd like to send a huge thanks to LinkedIn for sponsoring today’s show! LinkedIn Engineering solves complex problems at scale to create economic opportunity for every member of the global workforce. AI and ML are integral aspects of almost every product the company builds for its members and customers. LinkedIn’s highly structured dataset gives their data scientists and researchers the ability to conduct applied research to improve member experiences. To learn more about the work of LinkedIn Engineering, please visit engineering.linkedin.com/blog.

For the complete show notes, visit https:/twimlai.com/talk/236. 

Mar 04, 2019
Safer Exploration in Deep Reinforcement Learning using Action Priors with Sicelukwanda Zwane - TWiML Talk #235
54:01

Today we conclude our Black in AI series with Sicelukwanda Zwane, a masters student at the University of Witwatersrand and graduate research assistant at the CSIR.

At the workshop, he presented on “Safer Exploration in Deep Reinforcement Learning using Action Priors,” which explores transferring action priors between robotic tasks to reduce the exploration space in reinforcement learning, which in turn reduces sample complexity. In our conversation, we discuss what “safer exploration” means in this sense, the difference between this work and other techniques like imitation learning, and how this fits in with the goal of “lifelong learning.”

The complete show notes for this episode can be found at https://twimlai.com/talk/235. To follow along with the Black in AI series, visit https://twimlai.com/blackinai19.

Mar 01, 2019
Dissecting the Controversy around OpenAI's New Language Model - TWiML Talk #234
01:06:22

If you’re listening to this podcast, you’ve likely seen some of the press coverage and discussion surrounding the release, or lack thereof, of OpenAI’s new GPT-2 Language Model. The announcement caused quite a stir, with reactions spanning confusion, frustration, concern, and many points in between. Several days later, many open questions remained about the model and the way the release was handled.

Seeing the continued robust discourse, and wanting to offer the community a forum for exploring this topic with more nuance than Twitter’s 280 characters allow, we convened the inaugural “TWiML Live” panel. I was joined on the panel by Amanda Askell and Miles Brundage of OpenAI, Anima Anandkumar of NVIDIA and CalTech, Robert Munro of Lilt, and Stephen Merity, the latter being some of the most outspoken voices in the online discussion of this issue.

Our discussion thoroughly explored the many issues surrounding the GPT-2 release controversy. We cover the basics like what language models are and why they’re important, and why this announcement caused such a stir, and dig deep into why the lack of a full release of the model raised concerns for so many.

The discussion initially aired via Youtube Live and we’re happy to share it with you via the podcast as well. To be clear, both the panel discussion and live stream format were a bit of an experiment for us and we’d love to hear your thoughts on it. Would you like to see, or hear, more of these TWiML Live conversations? If so, what issues would you like us to take on?

If you have feedback for us on the format or if you’d like to join the discussion around OpenAI’s GPT-2 model, head to the show notes page for this show at twimlai.com/talk/234 and leave us a comment.

Feb 25, 2019
Human-Centered Design with Mira Lane - TWiML Talk #233
47:04

Today we present the final episode in our AI for the Benefit of Society series, in which we’re joined by Mira Lane, Partner Director for Ethics and Society at Microsoft.

Mira and I focus our conversation on the role of culture and human-centered design in AI. We discuss how Mira defines human-centered design, its connections to culture and responsible innovation, and how these ideas can be scalably implemented across large engineering organizations.

We’d like to thank Microsoft for their support and their sponsorship of this series. Microsoft is committed to ensuring the responsible development and use of AI and is empowering people around the world with intelligent technology to help solve previously intractable societal challenges spanning sustainability, accessibility and humanitarian action. Learn more at Microsoft.ai.

The complete show notes for this episode can be found at twimlai.com/talk/233. For more information on the AI for the Benefit of Society series, visit twimlai.com/ai4society.

Feb 22, 2019
Fairness in Machine Learning with Hanna Wallach - TWiML Talk #232
49:04

Today we’re joined by Hanna Wallach, a Principal Researcher at Microsoft Research.

Hanna and I really dig into how bias and a lack of interpretability and transparency show up across machine learning. We discuss the role that human biases, even those that are inadvertent, play in tainting data, and whether deployment of “fair” ML models can actually be achieved in practice, and much more. Along the way, Hanna points us to a TON of papers and resources to further explore the topic of fairness in ML. You’ll definitely want to check out the notes page for this episode, which you’ll find at twimlai.com/talk/232.

We’d like to thank Microsoft for their support and their sponsorship of this series. Microsoft is committed to ensuring the responsible development and use of AI and is empowering people around the world with intelligent technology to help solve previously intractable societal challenges spanning sustainability, accessibility and humanitarian action. Learn more at Microsoft.ai.

Feb 18, 2019
AI for Healthcare with Peter Lee - TWiML Talk #231
57:19

In this episode, we’re joined by Peter Lee, Corporate Vice President at Microsoft Research responsible for the company’s healthcare initiatives.

Peter and I met a few months ago at the Microsoft Ignite conference, where he gave me some really interesting takes on AI development in China. You can find more on that topic in the show notes. This conversation centers the three impact areas Peter sees for AI in healthcare, namely diagnostics and therapeutics, tools, and the future of precision medicine. We dig into some examples in each area, and Peter details the realities of applying machine learning and some of the impediments to rapid scale.

We’d like to thank Microsoft for their support and their sponsorship of this series. Microsoft is committed to ensuring the responsible development and use of AI and is empowering people around the world with intelligent technology to help solve previously intractable societal challenges spanning sustainability, accessibility and humanitarian action. Learn more at Microsoft.ai.

The complete show notes for this episode can be found at twimlai.com/talk/231.

Feb 18, 2019
An Optimized Recurrent Unit for Ultra-Low Power Acoustic Event Detection with Justice Amoh Jr. - TWiML Talk #230
45:51

Today, we're joined by Justice Amoh Jr., a Ph.D. student at Dartmouth’s Thayer School of Engineering.

Justice presented his work on “An Optimized Recurrent Unit for Ultra-Low Power Acoustic Event Detection.” In our conversation, we discuss his goal of bringing low cost, high-efficiency wearables to market for monitoring asthma. We explore the many challenges of using classical machine learning models on microcontrollers, and how he went about developing models optimized for constrained hardware environments. We’d also like to wish Justice the best of luck as he should be defending his Ph.D. any day now!

The complete show notes for this episode can be found at https://twimlai.com/talk/230. To follow along with the Black in AI series, visit https://twimlai.com/blackinai19.

 

Feb 11, 2019
Pathologies of Neural Models and Interpretability with Alvin Grissom II - TWiML Talk #229
32:28

Today, we're excited to continue our Black in AI series with Alvin Grissom II, Assistant Professor of Computer Science at Ursinus College.

Alvin’s research is focused on computational linguistics, and we begin with a brief chat about some of his prior work on verb prediction using reinforcement learning. We then dive into the paper he presented at the workshop, “Pathologies of Neural Models Make Interpretations Difficult.” We talk through some of the “pathological behaviors” he identified in the paper, how we can better understand the overconfidence of trained deep learning models in certain settings, and how we can improve model training with entropy regularization. We also touch on the parallel between his work and the work being done on adversarial examples by Ian Goodfellow and others.

For the complete show notes, visit https://twimlai.com/talk/229. To follow along with our Black in AI series, visit https://twimlai.com/blackinai19.

 

Feb 11, 2019
AI for Earth with Lucas Joppa - TWiML Talk #228
57:07

In this episode of our AI For the Benefit of Society with Microsoft series, we’re joined by Lucas Joppa and Zach Parisa.

Lucas is the Chief Environmental Officer at Microsoft, spearheading their 5 year, $50 million AI for Earth commitment, which seeks to apply machine learning and AI across four key environmental areas: agriculture, water, biodiversity, and climate change. Zack is Co-founder and president of SilviaTerra, a Microsoft AI for Earth grantee whose mission is to help people use modern data sources to better manage forest habitats and ecosystems.

In our conversation we discuss the ways that machine learning and AI can be used to advance our understanding of forests and other ecosystems and support conservation efforts. We discuss how SilviaTerra uses computer vision and data from a wide array of sensors like LIDAR, combined with AI, to yield more detailed small-area estimates of the various species in our forests. We also briefly discuss another AI for Earth project, WildMe, a computer vision based wildlife conservation project we discussed with Jason Holmberg back on episode 166.

The complete show notes for this episode can be found at https://twimlai.com/talk/288. To follow along with the entire AI for the Benefit of Society series, visit https://twimlai.com/ai4society.

We’d like to thank Microsoft for their support and their sponsorship of this series. Microsoft is committed to ensuring the responsible development and use of AI and is empowering people around the world with intelligent technology to help solve previously intractable societal challenges spanning sustainability, accessibility and humanitarian action. Learn more at https://Microsoft.ai.

 

 

Feb 08, 2019
AI for Accessibility with Wendy Chisholm - TWiML Talk #227
51:12

Today we’re joined by Wendy Chisholm, Lois Brady, and Matthew Guggemos. Wendy is a principal accessibility architect at Microsoft, and one of the chief proponents of the AI for Accessibility program, which extends grants to AI-powered accessibility projects the areas of Employment, Daily Life, and Communication & Connection. Lois and Matthew are Co-Founders and CEO and CTO, respectively, of iTherapy, an AI for Accessibility grantee and creator of the Inner Voice app, which utilizes visual language to strengthen communication in children on the autism scale.

In our conversation, we discuss the intersection of AI and accessibility, the lasting impact that innovation in AI can have for people with disabilities and society as a whole, and the importance of programs like AI for Accessibility in bringing projects in this area to fruition. 

For the complete show notes, visit https://twimlai.com/talk/226.

The transcript for this interview can be found at https://twimlai.com/talk/206/tx.

To follow along with the AI for the Benefit of Society series, visit https://twimlai.com/ai4society.

Thanks to Microsoft for their support and their sponsorship of this series. Microsoft is committed to ensuring the responsible development and use of AI and is empowering people around the world with intelligent technology to help solve previously intractable societal challenges spanning sustainability, accessibility and humanitarian action. Learn more at https://microsoft.ai.

 

Feb 06, 2019
AI for Humanitarian Action with Justin Spelhaug - TWiML Talk #226
59:21

Today we're joined by Justin Spelhaug, General Manager of Technology for Social Impact at Microsoft.

In our conversation, Justin and I discuss the company’s efforts in AI for Humanitarian Action, a program which extends grants to fund AI-powered projects focused on disaster response, the needs of children, protecting refugees, and promoting respect for human rights. We cover Microsoft’s overall approach to technology for social impact, how his group helps mission-driven organizations best leverage technologies like AI, and how AI is being used at places like the World Bank, Operation Smile, and Mission Measurement to create greater impact.

The complete show notes for this episode can be found at https://twimlai.com/talk/226. Follow along with the entire AI for the Benefit of Society series, visit https://twimlai.com/ai4society.

We’d like to thank Microsoft for their support of the show, and their sponsorship of this series.  Microsoft is committed to ensuring the responsible development and use of AI and is empowering people around the world with this intelligent technology to help solve previously intractable societal challenges spanning sustainability, accessibility and humanitarian action. Learn more about their plan at Microsoft.ai

Feb 04, 2019
Teaching AI to Preschoolers with Randi Williams - TWiML Talk #225
44:32

Today, in the first episode of our Black in AI series, we’re joined by Randi Williams, PhD student at the MIT Media Lab.

At the Black in AI workshop Randi presented her research on Popbots: A Early Childhood AI Curriculum, which is geared towards teaching preschoolers the fundamentals of artificial intelligence. In our conversation, we discuss the origins of the project, the three AI concepts that are taught in the program, and the goals that Randi hopes to accomplish with her work. This was a fun conversation!

The complete show notes for this episode can be found at twimlai.com/talk/225.

Follow along with our Black in AI series at twimlai.com/blackinai19.

Jan 31, 2019
Holistic Optimization of the LinkedIn News Feed - TWiML Talk #224
48:24

Today we’re joined by Tim Jurka, Head of Feed AI at LinkedIn.

As you can imagine Feed AI is responsible for curating all the content you see daily on the LinkedIn site. What’s less apparent to those that don’t work on this type of product is the wide variety of opposing factors that need to be considered in organizing the feed. As you’ll learn in our conversation, Tim calls this the holistic optimization of the feed and we discuss some of the interesting technical and business challenges associated with trying to do this. We talk through some of the specific techniques used at LinkedIn like Multi-arm Bandits and Content Embeddings, and also jump into a really interesting discussion about organizing for machine learning at scale.

We’d like to send a huge thanks to LinkedIn for sponsoring today’s show! LinkedIn Engineering solves complex problems at scale to create economic opportunity for every member of the global workforce. AI and ML are integral aspects of almost every product the company builds for its members and customers. LinkedIn’s highly structured dataset gives their data scientists and researchers the ability to conduct applied research to improve member experiences. To learn more about the work of LinkedIn Engineering, please visit https://engineering.linkedin.com/blog.

The complete show notes can be found at https://twimlai.com/talk/224.

Jan 28, 2019
AI at the Edge at Qualcomm with Gary Brotman - TWiML Talk #223
51:54

Today we’re joined by Gary Brotman, Senior Director of Product Management at Qualcomm Technologies, Inc.

Gary, who got his start in AI through music, now leads strategy and product planning for the company’s Artificial Intelligence and Machine Learning technologies, including those that make up the Qualcomm Snapdragon mobile platforms. In our conversation, we discuss AI on mobile devices and at the edge, including popular use cases, and explore some of the various acceleration technologies offered by Qualcomm and others that enable them. We also dig into the state of AI on devices from the application developer’s perspective, and how various acceleration technologies fit together to help developers bring new products to market.

Thanks to our friends at Qualcomm for sponsoring today’s show! As you’ll hear in the conversation with Gary, Qualcomm has been in the AI space for well over a decade now, powering some of the latest and greatest Android devices with their Snapdragon chipset. With their strong footing in the mobile chipset space, Qualcomm now has the goal of making AI at the edge ubiquitous, beyond mobile devices. To find out more about what they’re up to, and how they plan to get there, visit twimlai.com/qualcomm.

The complete show notes for this episode can be found at twimlai.com/talk/223.

Jan 24, 2019
AI Innovation at CES - TWiML Talk #222
02:01

A few weeks ago, I made the trek to Las Vegas for the world’s biggest electronics conference, CES.

CES is one of those things that’s hard to fully understand without having seen, so I thought it’d be fun to give you a look at it from my vantage point. In this special visual episode, we’re going to check out some of the interesting examples of machine learning and AI that I found at the event. We cover a bunch of different categories, including several that don’t really target consumers at all, like John Deere’s gigantic, combine harvester, a company building a drone that stops bullets, and a startup that wants to do away with something we all despise, traffic.

Check out the video at https://twimlai.com/ces2019, and be sure to hit the like and subscribe buttons and let us know how you like the show via a comment!

For the show notes, visit https://twimlai.com/talk/222.

 

Jan 21, 2019
Self-Tuning Services via Real-Time Machine Learning with Vladimir Bychkovsky - TWiML Talk #221
46:06

Today we’re joined by Vladimir Bychkovsky, Engineering Manager at Facebook, to discuss Spiral.

Spiral is a system they’ve developed for self-tuning high-performance infrastructure services at scale, using real-time machine learning. In our conversation, we explore the ins and outs of Spiral, including how the system works, how it was developed, and how infrastructure teams at Facebook can use it to replace hand-tuned parameters set using heuristics with services that automatically optimize themselves in minutes rather than in weeks. We also discuss the challenges of implementing these kinds of systems, overcoming user skepticism, and achieving an appropriate level of explainability.

The complete show notes for this episode can be found at twimlai.com/talk/221

 

Jan 17, 2019
Building a Recommender System from Scratch at 20th Century Fox with JJ Espinoza - TWiML Talk #220
35:08

Today we’re joined by JJ Espinoza, former Director of Data Science at 20th Century Fox.

In this talk we start out with a discussion JJ’s transition from econometrician to data scientist, and then dig into his and his team’s experience building and deploying a content recommendation system from the ground up. In our conversation, we explore the design of a couple of key components of their system, the first of which processes movie scripts to make recommendations about which movies the studio should make, and the second processes trailers to determine which should be recommended to users. We discuss the challenges they’ve encountered fielding these systems, some of the tools that were used along the way, and a few of the upcoming projects that could be layered on top of the platform they’ve built.

For the complete show notes for this episode, visit twimlai.com/talk/220.

If this talk piqued your interest, you should also check out Talk #201, where Leemay Nassery of Comcast breaks down how she led the rebuild of the Comcast Xfinity X1 recommender platform.

 

Jan 14, 2019
Legal and Policy Implications of Model Interpretability with Solon Barocas - TWiML Talk #219
47:00

Today we’re joined by Solon Barocas, Assistant Professor of Information Science at Cornell University.

Solon is also the co-founder of the Fairness, Accountability, and Transparency in Machine Learning workshop that is hosted annually at conferences like ICML. Solon and I caught up to discuss his work on model interpretability and the legal and policy implications of the use of machine learning models. In our conversation, we discuss the gap between law, policy, and ML, and how to build the bridge between them, including formalizing ethical frameworks for machine learning. We also look at his paper ”The Intuitive Appeal of Explainable Machines,” which proposes that explainability is really two problems, inscrutability and non-intuitiveness, and that disentangling the two allows us to better reason about the kind of explainability that’s really needed in any given situation.

The complete show notes for this episode can be found at https://twimlai.com/talk/219.

And be sure to sign up for our weekly newsletter at https://twimlai.com/newsletter! 

 

Jan 10, 2019
Trends in Computer Vision with Siddha Ganju - TWiML Talk #218
01:11:01

In the final episode of our AI Rewind series, we’re excited to have Siddha Ganju back on the show.

Siddha, who is now an autonomous vehicles solutions architect at Nvidia shares her thoughts on trends in Computer Vision in 2018 and beyond. We cover her favorite CV papers of the year in areas such as neural architecture search, learning from simulation, application of CV to augmented reality, and more, as well as a bevy of tools and open source projects.

The complete show notes for this episode can be found at https://twimlai.com/talk/218

For more information on our AI Rewind series, visit https://twimlai.com/rewind18.

Jan 07, 2019
Trends in Reinforcement Learning with Simon Osindero - TWiML Talk #217
52:46

In this episode of our AI Rewind series, we introduce a new friend of the show, Simon Osindero, Staff Research Scientist at DeepMind.

We discuss trends in Deep Reinforcement Learning in 2018 and beyond. We’ve packed a bunch into this show, as Simon walks us through many of the important papers and developments seen last year in areas like Imitation Learning, Unsupervised RL, Meta-learning, and more.

The complete show notes for this episode can be found at https://twimlai.com/talk/217.

For more information on our 2018 AI Rewind series, visit https://twimlai.com/rewind2018.

 

 

Jan 03, 2019
Trends in Natural Language Processing with Sebastian Ruder - TWiML Talk #216
53:32

In this episode of our AI Rewind series, we’ve brought back recent guest Sebastian Ruder, PhD Student at the National University of Ireland and Research Scientist at Aylien, to discuss trends in Natural Language Processing in 2018 and beyond.

In our conversation we cover a bunch of interesting papers spanning topics such as pre-trained language models, common sense inference datasets and large document reasoning and more, and talk through Sebastian’s predictions for the new year.

The complete show notes for this episode can be found at twimlai.com/talk/216.

For more information on the AI Rewind 2018 series, visit twimlai.com/rewind18.

Dec 31, 2018
Trends in Machine Learning with Anima Anandkumar - TWiML Talk #215
51:54

In this episode of our AI Rewind series, we’re back with Anima Anandkumar, Bren Professor at Caltech and now Director of Machine Learning Research at NVIDIA.

Anima joins us to discuss her take on trends in the broader Machine Learning field in 2018 and beyond. In our conversation, we cover not only technical breakthroughs in the field but also those around inclusivity and diversity.

For this episode's complete show notes, visit twimlai.com/talk/215.

For more information on the AI Rewind series, visit twimlai.com/rewind18.

Dec 27, 2018
Trends in Deep Learning with Jeremy Howard - TWiML Talk #214
01:08:47

In this episode of our AI Rewind series, we’re bringing back one of your favorite guests of the year, Jeremy Howard, founder and researcher at Fast.ai.

Jeremy joins us to discuss trends in Deep Learning in 2018 and beyond. We cover many of the papers, tools and techniques that have contributed to making deep learning more accessible than ever to so many developers and data scientists.

The complete show notes for this episode can be found at https://twimlai.com/talk/214.

Follow along with our AI Rewind 2018 series visit https://twimlai.com/rewind18

Dec 24, 2018
Training Large-Scale Deep Nets with RL with Nando de Freitas - TWiML Talk #213
55:17

Today we close out both our NeurIPS series and our 2018 conference coverage with this interview with Nando de Freitas, Team Lead & Principal Scientist at Deepmind and Fellow at the Canadian Institute for Advanced Research.

In our conversation, we explore his interest in understanding the brain and working towards artificial general intelligence through techniques like meta-learning, few-shot learning and imitation learning. In particular, we dig into a couple of his team’s NeurIPS papers: “Playing hard exploration games by watching YouTube,” and “One-Shot high-fidelity imitation: Training large-scale deep nets with RL.”

The complete show notes for this episode can be found at https://twimlai.com/talk/213.

For more information on the NeurIPS series, visit https://twimlai.com/neurips2018.

 

Dec 20, 2018
Making Algorithms Trustworthy with David Spiegelhalter - TWiML Talk #212
23:26

In this, the second episode of our NeurIPS series, we’re joined by David Spiegelhalter, Chair of Winton Center for Risk and Evidence Communication at Cambridge University and President of the Royal Statistical Society.

David, an invited speaker at NeurIPS, presented on “Making Algorithms Trustworthy: What Can Statistical Science Contribute to Transparency, Explanation and Validation?”. In our conversation, we explore the nuanced difference between being trusted and being trustworthy, and its implications for those building AI systems. We also dig into how we can evaluate trustworthiness, which David breaks into four phases, the inspiration for which he drew from British philosopher Onora O'Neill's ideas around 'intelligent transparency’.

The complete show notes for this episode can be found at twimlai.com/talk/212.

For more information on the NeurIPS series, visit twimlai.com/neurips2018.

Dec 20, 2018
Designing Computer Systems for Software with Kunle Olukotun - TWiML Talk #211
56:32

Today we’re joined by Kunle Olukotun, Professor in the department of Electrical Engineering and Computer Science at Stanford University, and Chief Technologist at Sambanova Systems.

Kunle was an invited speaker at NeurIPS this year, presenting on “Designing Computer Systems for Software 2.0.” In our conversation, we discuss various aspects of designing hardware systems for machine and deep learning, touching on multicore processor design, domain specific languages, and graph-based hardware. We cover the limitations of the current hardware such as GPUs, and peer a bit into the future as well. This was a fun one!

The complete show notes for this episode can be found at twimlai.com/talk/211

For more information on this series, visit twimlai.com/neurips2018.

Dec 18, 2018
Operationalizing Ethical AI with Kathryn Hume - TWiML Talk #210
54:28

Today we conclude our Trust in AI series with this conversation with Kathryn Hume, VP of Strategy at Integrate AI.

You might remember Kathryn from our interview last year on “Selling AI to the Enterprise,” which was TWiML Talk #20. This time around, we discuss her newly released white paper “Responsible AI in the Consumer Enterprise,” which details a framework for ethical AI deployment in e-commerce companies and other consumer-facing enterprises. We look at the structure of the ethical framework she proposes, and some of the many questions that need to be considered when deploying AI in an ethical manner.

For the complete show notes for this episode, visit twimlai.com/talk/210.

 

Dec 14, 2018
Approaches to Fairness in Machine Learning with Richard Zemel - TWiML Talk #209
46:12