About this episode
Artificial intelligence is the automation of tasks that require human intelligence, encompassing fields like natural language processing, perception, planning, and robotics, with machine learning emerging as the primary method to recognize patterns in data and make predictions. Data science serves as the overarching discipline that includes artificial intelligence and machine learning, focusing broadly on extracting knowledge and actionable insights from data using scientific and computational methods.
Links
Data Science Overview
- Data science encompasses any professional role that deals extensively with data, including but not limited to artificial intelligence and machine learning.
- The data science pipeline includes data ingestion, storage, cleaning (feature engineering), and outputs in data analytics, business intelligence, or machine learning.
- A data lake aggregates raw data from multiple sources, while a feature store holds cleaned and transformed data, prepared for analysis or model training.
- Data analysts and business intelligence professionals work primarily with data warehouses to generate human-readable reports, while machine learning engineers use transformed data to build and deploy predictive models.
- At smaller organizations, one person ("data scientist") may perform all data pipeline roles, whereas at large organizations, each phase may be specialized.
- Wikipedia: Data Science describes data science as the interdisciplinary field for extracting knowledge and insights from structured and unstructured data.
Artificial Intelligence: Definition and Sub-disciplines
- Artificial intelligence (AI) refers to the theory and development of computer systems capable of performing tasks that typically require human intelligence, such as visual perception, speech recognition, decision-making, and language translation. (Wikipedia: Artificial Intelligence)
- The AI discipline is divided into subfields:
- Reasoning and problem solving
- Knowledge representation (such as using ontologies or knowledge graphs)
- Planning (selecting actions in an environment, e.g., chess- or Go-playing bots, self-driving cars)
- Learning
- Natural language processing (simulated language, machine translation, chatbots, speech recognition, question answering, summarization)
- Perception (AI perceives the world with sensors; e.g., cameras, microphones in self-driving cars)
- Motion and manipulation (robotics, transforming decisions into physical actions via actuators)
- Social intelligence (AI tuned to human emotions, sentiment analysis, emotion recognition)
- General intelligence (Artificial General Intelligence, or AGI: a system that generalizes across all domains at or beyond human skill)
- Applications of AI include autonomous vehicles, medical diagnosis, creating art, proving theorems, playing strategy games, search engines, digital assistants, image recognition, spam filtering, judicial decision prediction, and targeted online advertising.
- AI has both objective definitions (automation of intellectual tasks) and subjective debates around the threshold for "intelligence."
- The Turing Test posits that if a human cannot distinguish an AI from another human through conversation, the AI can be considered intelligent.
- Weak AI targets specific domains, while general AI aspires to domain-independent capability.
- AlphaGo Movie depicts the use of AI planning and learning in the game of Go.
Machine Learning: Within AI
- Machine learning (ML) is a subdiscipline of AI focused on building models that learn patterns from data and make predictions or decisions. (Wikipedia: Machine Learning)
- Machine learning involves feeding data (such as spreadsheets of stock prices) into algorithms that detect patterns (learning phase) and generate models, which are then used to predict future outcomes.
- Although ML started as a distinct subfield, in recent years it has subsumed many of the original AI subdisciplines, becoming the primary approach in areas like natural language processing, computer vision, reasoning, and planning.
- Deep learning has driven this shift, employing techniques such as neural networks, convolutional networks (image processing), and transformers (language tasks), allowing generalizable solutions across multiple domains.
- Reinforcement learning, a form of machine learning, enables AI systems to learn sequences of actions in complex environments, such as games or real-world robotics, by maximizing cumulative rewards.
- Modern unified ML models, such as Google's Pathways and transformer architectures, can now tackle tasks in multiple subdomains (vision, language, decision-making) with a single framework.
Data Pipeline and Roles in Data Science
- Data engineering covers obtaining and storing raw data from various data sources (datasets, databases, streams), aggregating into data lakes, and applying schema or permissions.
- Feature engineering cleans and transforms raw data (imputation, feature transformation, selection) for machine learning or analytics.
- Data warehouses store column-oriented, recent slices of data optimized for fast querying and are used by analysts and business intelligence professionals.
- The analytics branch (data analysts, BI professionals) uses cleaned, curated data to generate human insights and reports.
- Data analysts apply technical and coding skills, while BI professionals often use specialized tools (e.g., Tableau, Power BI).
- The machine learning branch uses feature data to train predictive models, automate decisions, and in some cases, trigger actions (robots, recommender systems).
- The role of a "data scientist" can range from specialist to generalist, depending on team size and industry focus.
Historical Context of Artificial Intelligence
- Early concepts of artificial intelligence appear in Greek mythology (automatons) and Jewish mythology (Golems).
- Ramon Lull in the 13th century and Leonardo da Vinci constructed early automatons.
- Contributions:
- Thomas Bayes (probability inference, 1700s)
- George Boole (logical reasoning, binary algebra)
- Gottlob Frege (propositional logic)
- Charles Babbage and Ada Byron/Lovelace (Analytical Engine, 1832)
- Alan Turing (Universal Turing Machine, 1936; foundational ideas on computing and AI)
- John von Neumann (Universal Computing Machine, 1946)
- Warren McCulloch, Walter Pitts, Frank Rosenblatt (artificial neurons, perceptron, foundation of connectionist/neural net models)
- John McCarthy, Marvin Minsky, Arthur Samuel, Oliver Selfridge, Ray Solomonoff, Allen Newell, Herbert Simon (Dartmouth Workshop, 1956: "AI" coined)
- Newell and Simon (Heuristics, General Problem Solver)
- Feigenbaum (expert systems)
- GOFAI/symbolism (logic- and knowledge-based systems)
- The "AI winter" followed the Lighthill report (1970s) due to overpromising and slow real-world progress.
- AI resurgence in the 1990s was fueled by advances in computation, increased availability of data (the era of "big data"), and improvements in neural network methodologies (notably Geoffrey Hinton's optimization of backpropagation in 2006).
- The 2010s saw dramatic progress, with companies such as DeepMind (acquired by Google in 2014) achieving state-of-the-art results in reinforcement learning and general AI research.
- The Sub-disciplines of AI and other resources:
Further Learning Resources
Feb 2017
MLG 003 Inspiration
<div> <p>AI is rapidly transforming both creative and knowledge-based professions, prompting debates on economic disruption, the future of work, the singularity, consciousness, and the potential risks associated with powerful autonomous systems. Philosophical discussions now focu ... Show More
19m 9s
Feb 2017
MLG 004 Algorithms - Intuition
<div> <p>Machine learning consists of three steps: prediction, error evaluation, and learning, implemented by training algorithms on large datasets to build models that can make decisions or classifications. The primary categories of machine learning algorithms are supervised, un ... Show More
23m 27s
Feb 2017
MLG 005 Linear Regression
<div> <p>Linear regression is introduced as the foundational supervised learning algorithm for predicting continuous numeric values, using cost estimation of Portland houses as an example. The episode explains the three-step process of machine learning - prediction via a hypothes ... Show More
34m 29s
May 2023
The AI Will See You Now: Exploring Biomedical AI and Google’s Med-PaLM2 With Karan Singhal
What if AI could revolutionize healthcare with advanced language learning models? Sarah and Elad welcome Karan Singhal, Staff Software Engineer at Google Research, who specializes in medical AI and the development of MedPaLM2. On this episode, Karan emphasizes the importance of s ... Show More
42m 48s
Dec 2024
The TED AI Show: How AI robots learn just like babies — but a million times faster w/ NVIDIA’s Rev Lebaredian
<p>Computers have been outperforming humans for years on tasks like solving complex equations or analyzing data, but when it comes to the physical world, robots struggle to keep up. It can take years to train robots to function in the messy chaos of the “real world” — but thanks ... Show More
53m 27s
Oct 2024
Never Too Much AI: Upwork's Andrew Rabinovich
Andrew Rabinovich began his career in technology working on AI applications for cancer detection. He also spent time at Google, working on early iterations of products like Google Glass. Now at Upwork, as vice president and head of AI and machine learning, Rabinovich and his team ... Show More
35m 37s
Nov 2024
SE Radio 641: Catherine Nelson on Machine Learning in Data Science
<p><strong>Catherine Nelson</strong>, author of the new O'Reilly book, <em data-renderer-mark="true">Software Engineering for Data Scientists</em>, discusses the collaboration between data scientists and software engineers -- an increasingly common pairing on machine learning and ... Show More
48m 19s
Apr 2024
Measuring The Speed of AI Through Benchmarks
<p dir="ltr">David Kanter, Executive Director at MLCommons, discusses the work they're doing with MLPerf Benchmarks, creating the world's first industry standard approach to measuring AI speed and safety. He also shares ways they're testing AI and LLMs for harm, to measure—and, o ... Show More
31m 45s
Jun 2024
#218 Designing AI Applications with Robb Wilson, Co-Founder & CEO at Onereach.ai
All the hype around generative AI means that every software maker seems to be stuffing chat interfaces into their products whenever they can. For the most part, the jury is still out on whether this is a good idea or not. However, design goes deeper than just the user interface, ... Show More
46m 36s
Dec 2024
The TED AI Show: Could AI really achieve consciousness? w/ neuroscientist Anil Seth
<p>Human brains are often described as computers — machines that are “wired” to make decisions and respond to external stimuli in a way that’s not so different from the artificial intelligence that we increasingly use each day. But the difference between our brains and the comput ... Show More
56m 51s
Apr 2024
777: Generative AI in Practice, with Bernard Marr
Generative AI is reshaping our world, and Bernard Marr, world-renowned futurist and best-selling author, joins Jon Krohn to guide us through this transformation. In this episode, Bernard shares his insights on how AI is transforming industries, revolutionizing daily life, and add ... Show More
1h 8m
Dec 2024
BI 201 Rajesh Rao: From Predictive Coding to Brain Co-Processors
<p class="has-text-align-center"><strong><a href="https://www.patreon.com/braininspired">Support the show</a> to get full episodes, full archive, and join the Discord community</strong>.</p>
<a href="https://www.patreon.com/braininspired"></a>
<p>Today I'm in conversati ... Show More
1h 37m