Light

6.1 AI and ML fundamentals

12 min read•august 20, 2024

AI and ML are transforming industries, enabling automation and data-driven insights. These technologies are key to digital transformation, helping organizations create value and gain a competitive edge. Understanding AI and ML fundamentals is crucial for developing effective strategies.

AI encompasses various approaches, including rule-based systems and . ML, a subset of AI, allows computers to learn from data without explicit programming. , using , has achieved breakthroughs in areas like image recognition and natural language processing.

Defining AI and ML

(AI) and Machine Learning (ML) are key drivers of digital transformation, enabling organizations to automate processes, gain insights from data, and create intelligent systems
Understanding the fundamentals of AI and ML is essential for developing effective digital transformation strategies that leverage these technologies to create value and competitive advantage

AI as a broad concept

Top images from around the web for AI as a broad concept

Explainer: Autonomous and Semi-autonomous vehicles – Ned Hayes View original
Is this image relevant?
Frontiers | Automotive Intelligence Embedded in Electric Connected Autonomous and Shared ... View original
Is this image relevant?
Artificial Intelligence View original
Is this image relevant?
Explainer: Autonomous and Semi-autonomous vehicles – Ned Hayes View original
Is this image relevant?
Frontiers | Automotive Intelligence Embedded in Electric Connected Autonomous and Shared ... View original
Is this image relevant?

1 of 3

Top images from around the web for AI as a broad concept

Explainer: Autonomous and Semi-autonomous vehicles – Ned Hayes View original
Is this image relevant?
Frontiers | Automotive Intelligence Embedded in Electric Connected Autonomous and Shared ... View original
Is this image relevant?
Artificial Intelligence View original
Is this image relevant?
Explainer: Autonomous and Semi-autonomous vehicles – Ned Hayes View original
Is this image relevant?
Frontiers | Automotive Intelligence Embedded in Electric Connected Autonomous and Shared ... View original
Is this image relevant?

1 of 3

AI refers to the development of computer systems that can perform tasks that typically require human intelligence (visual perception, speech recognition, decision-making, language translation)
AI encompasses various approaches and technologies, including rule-based systems, expert systems, and machine learning
AI systems can be narrow (focused on specific tasks) or general (capable of performing a wide range of tasks)
Examples of AI applications include virtual assistants (Siri, Alexa), autonomous vehicles, and intelligent recommendation systems

Machine learning as a subset of AI

Machine learning is a subset of AI that focuses on enabling computers to learn and improve from experience without being explicitly programmed
ML algorithms build mathematical models based on sample data () to make predictions or decisions without being explicitly programmed to do so
ML systems can adapt and improve their performance over time as they are exposed to more data
Examples of ML applications include spam filters, fraud detection systems, and image recognition algorithms

Deep learning and neural networks

Deep learning is a subfield of machine learning that uses artificial neural networks with multiple layers (deep neural networks) to learn hierarchical representations of data
Neural networks are inspired by the structure and function of the human brain, consisting of interconnected nodes (neurons) organized in layers
Deep learning has achieved breakthrough performance in areas such as image , natural language processing, and speech recognition
Examples of deep learning applications include facial recognition systems, language translation services, and self-driving cars

Key differences between AI and ML

Understanding the distinctions between AI and ML is crucial for developing effective digital transformation strategies that align with organizational goals and capabilities
Comparing rule-based vs learning systems and exploring different types of learning approaches helps organizations choose the most appropriate AI/ML techniques for their specific use cases

Comparing rule-based vs learning systems

Rule-based AI systems rely on explicit, hand-crafted rules to make decisions or solve problems (if-then statements, )
Learning systems, such as ML algorithms, automatically learn patterns and relationships from data without being explicitly programmed
Rule-based systems are deterministic and transparent but can be brittle and difficult to scale, while learning systems are more flexible and scalable but can be opaque and harder to interpret
Examples of rule-based systems include expert systems for medical diagnosis, while learning systems include recommendation engines and predictive maintenance algorithms

Exploring supervised vs unsupervised learning

involves training ML models on labeled data, where the desired output is known (input-output pairs)
involves discovering patterns and structures in unlabeled data without predefined output labels
Supervised learning is used for prediction and classification tasks (spam filters, credit risk assessment), while unsupervised learning is used for clustering and anomaly detection (customer segmentation, fraud detection)
combines labeled and unlabeled data to improve model performance when labeled data is scarce or expensive to obtain

Classification vs regression in ML

Classification is a supervised learning task that involves predicting discrete class labels or categories (binary or multiclass)
is a supervised learning task that involves predicting continuous numerical values (stock prices, housing prices)
Classification algorithms include logistic regression, decision trees, and , while regression algorithms include linear regression, polynomial regression, and gradient boosting
Examples of classification tasks include email spam detection and image recognition, while regression tasks include sales forecasting and demand prediction

Essential components of ML systems

Building effective ML systems requires a deep understanding of the key components and best practices involved in the ML development process
Focusing on data quality, , and proper dataset splitting is essential for creating robust and reliable ML models that drive successful digital transformation initiatives

Importance of data in ML

Data is the foundation of ML systems, as models learn patterns and relationships from the data they are trained on
The quality, quantity, and representativeness of the training data directly impact the performance and generalization ability of ML models
Data collection, preprocessing, and augmentation techniques (data cleaning, normalization, feature scaling) are crucial for preparing data for ML tasks
Examples of data sources for ML include structured databases, unstructured text, images, and sensor data from IoT devices

Feature selection and engineering

Feature selection involves identifying the most informative and relevant features (input variables) for a given ML task
Feature engineering involves creating new features or transforming existing ones to improve the performance of ML models
Techniques for feature selection include statistical tests (chi-squared, ANOVA), correlation analysis, and recursive feature elimination
Feature engineering techniques include one-hot encoding for categorical variables, feature scaling (normalization, standardization), and domain-specific transformations (log transformation, polynomial features)

Training, validation and test datasets

ML datasets are typically split into three subsets: training, validation, and test sets
The training set is used to fit the model parameters and learn patterns from the data
The validation set is used to tune hyperparameters and assess model performance during development
The test set is used to evaluate the final model's performance on unseen data and estimate its generalization ability
Proper dataset splitting (stratified sampling, cross-validation) is essential to avoid overfitting and ensure reliable model evaluation

Common ML algorithms and models

Familiarity with a diverse range of ML algorithms and models is essential for selecting the most appropriate techniques for specific digital transformation use cases
Understanding the strengths, weaknesses, and underlying assumptions of each algorithm helps organizations make informed decisions when developing ML-driven solutions

Decision trees and random forests

Decision trees are hierarchical models that make predictions by recursively splitting the input space based on feature values
are ensemble models that combine multiple decision trees to improve prediction and reduce overfitting
Decision trees are interpretable and can handle both categorical and numerical features, but can be prone to overfitting
Random forests are more robust and generally achieve higher accuracy, but are less interpretable than individual decision trees
Examples of applications include credit risk assessment, customer churn prediction, and disease diagnosis

Support vector machines (SVMs)

SVMs are discriminative models that find the optimal hyperplane to separate classes in high-dimensional feature spaces
SVMs can handle non-linearly separable data by using kernel functions (polynomial, RBF) to transform the input space
SVMs are effective for binary classification tasks and can handle high-dimensional data, but are sensitive to hyperparameter tuning
Applications of SVMs include text classification, image recognition, and bioinformatics (protein structure prediction)

Naive Bayes classifiers

classifiers are probabilistic models that make predictions based on Bayes' theorem and the assumption of feature independence
Naive Bayes classifiers are computationally efficient and can handle high-dimensional data, but the independence assumption may not hold in practice
Variants of Naive Bayes include Gaussian Naive Bayes (continuous features), Multinomial Naive Bayes (discrete features), and Bernoulli Naive Bayes (binary features)
Applications of Naive Bayes include spam filtering, sentiment analysis, and document classification

K-nearest neighbors (KNN)

KNN is a non-parametric algorithm that makes predictions based on the majority class or average value of the K nearest neighbors in the feature space
KNN is simple to implement and can handle multi-class classification and regression tasks, but can be computationally expensive for large datasets
The choice of K and the distance metric (Euclidean, Manhattan, Minkowski) can significantly impact the performance of KNN
Applications of KNN include recommendation systems, anomaly detection, and image classification

Neural networks and deep learning

Neural networks and deep learning have revolutionized various domains by enabling the learning of complex, hierarchical representations from raw data
Understanding the fundamentals of neural networks and popular architectures is crucial for leveraging deep learning in digital transformation initiatives

Basics of artificial neural networks

Artificial neural networks are composed of interconnected nodes (neurons) organized in layers (input, hidden, output)
Each neuron computes a weighted sum of its inputs, applies an activation function (sigmoid, ReLU, tanh), and passes the output to the next layer
Neural networks learn by adjusting the weights of the connections through backpropagation and optimization algorithms (gradient descent, Adam)
Key concepts in neural networks include , (cross-entropy, mean squared error), and (L1/L2, dropout)

Convolutional neural networks (CNNs)

CNNs are specialized neural networks designed for processing grid-like data, such as images and time series
CNNs use convolutional layers to learn local patterns and features, and pooling layers to reduce spatial dimensions and introduce translation invariance
CNNs have achieved state-of-the-art performance in tasks such as image classification, object detection, and semantic segmentation
Applications of CNNs include facial recognition, medical image analysis, and autonomous vehicles

Recurrent neural networks (RNNs)

RNNs are neural networks designed for processing sequential data, such as text, speech, and time series
RNNs have recurrent connections that allow them to maintain a hidden state and capture long-term dependencies in the input sequence
Variants of RNNs, such as Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRU), address the vanishing gradient problem and improve the learning of long-range dependencies
Applications of RNNs include language modeling, machine translation, and speech recognition

Generative adversarial networks (GANs)

GANs are a class of deep learning models that consist of two neural networks: a generator and a discriminator
The generator learns to create realistic samples (images, text, audio) from random noise, while the discriminator learns to distinguish between real and generated samples
GANs are trained through an adversarial process, where the generator and discriminator compete to improve their respective abilities
Applications of GANs include image synthesis, style transfer, and data augmentation for ML tasks

ML development process and lifecycle

Developing successful ML solutions requires a structured approach that covers all stages of the ML lifecycle, from problem definition to deployment and maintenance
Understanding the key steps and best practices in the ML development process is essential for ensuring the reliability, scalability, and maintainability of ML-driven digital transformation initiatives

Problem definition and data collection

Clearly defining the problem and identifying the business objectives is the first step in the ML development process
Data collection involves gathering relevant and representative data from various sources (databases, APIs, sensors) to support the ML task
Data quality assessment and data exploration techniques (summary statistics, visualizations) help identify potential issues and gain insights into the data
Establishing data pipelines and storage solutions (data lakes, data warehouses) is crucial for ensuring a reliable and scalable data infrastructure

Data preprocessing and cleaning

Data preprocessing involves transforming raw data into a suitable format for ML algorithms
Common preprocessing steps include handling missing values (imputation, deletion), encoding categorical variables (one-hot encoding, label encoding), and scaling numerical features (normalization, standardization)
Data cleaning techniques (outlier detection, deduplication) help identify and remove or correct erroneous or inconsistent data points
Feature selection and engineering techniques are applied to select the most informative features and create new features that improve model performance

Model selection and hyperparameter tuning

Model selection involves choosing the most appropriate ML algorithm or architecture for the given problem and data
Factors to consider in model selection include the type of problem (classification, regression), the size and structure of the data, and the interpretability and computational requirements of the model
Hyperparameter tuning is the process of finding the optimal values for the model's hyperparameters (learning rate, regularization strength, number of hidden layers) to maximize performance
Techniques for hyperparameter tuning include grid search, random search, and Bayesian optimization

Deployment, monitoring and maintenance

Deployment involves integrating the trained ML model into a production environment, such as a web application or a mobile app
Containerization technologies (Docker, Kubernetes) and serverless platforms (AWS Lambda, Google Cloud Functions) facilitate the scalable and efficient deployment of ML models
Monitoring the performance and behavior of deployed ML models is essential for detecting issues and ensuring the models remain accurate and reliable over time
Techniques for monitoring ML models include tracking performance metrics (accuracy, , recall), monitoring data drift, and setting up alerts for anomalies or degraded performance
Maintenance activities include retraining models on new data, updating model architectures, and addressing security vulnerabilities or performance bottlenecks

Challenges and limitations of AI/ML

While AI and ML offer significant opportunities for digital transformation, organizations must also be aware of the challenges and limitations associated with these technologies
Addressing issues related to bias, interpretability, concept drift, and ethics is crucial for developing responsible and trustworthy AI/ML solutions

Bias and fairness in ML models

ML models can inherit biases present in the training data, leading to unfair or discriminatory outcomes (gender bias, racial bias)
Sources of bias include unrepresentative or imbalanced datasets, biased data collection processes, and human biases embedded in the data
Techniques for mitigating bias include using diverse and representative datasets, applying fairness constraints during training, and conducting bias audits on trained models
Ensuring fairness in ML models is essential for building trust and avoiding unintended consequences, particularly in sensitive domains such as hiring, lending, and criminal justice

Interpretability and explainability

Many ML models, particularly deep learning models, are complex and opaque, making it difficult to understand how they make predictions or decisions
Lack of interpretability can hinder the adoption of ML models in high-stakes domains (healthcare, finance) where and accountability are critical
Techniques for improving interpretability include using inherently interpretable models (decision trees, linear models), applying post-hoc explanation methods (LIME, SHAP), and developing explainable AI (XAI) frameworks
Explainable AI aims to provide human-understandable explanations for the behavior and decisions of ML models, enhancing trust and facilitating debugging and improvement

Handling concept drift and model decay

Concept drift refers to the change in the underlying data distribution or the relationship between inputs and outputs over time
Model decay occurs when the performance of a deployed ML model degrades due to concept drift or changes in the environment
Techniques for handling concept drift include continuous monitoring of model performance, updating models with new data, and using adaptive learning algorithms (online learning, incremental learning)
Strategies for mitigating model decay include regular retraining, using ensemble models, and implementing automated model management and versioning systems

Ethical considerations in AI/ML

The development and deployment of AI/ML systems raise ethical concerns related to privacy, security, transparency, and accountability
Ethical issues include data privacy and consent, and discrimination, job displacement, and the potential misuse of AI for malicious purposes
Addressing ethical considerations requires a multidisciplinary approach involving technical, legal, and social perspectives
Best practices for ethical AI/ML include establishing ethical guidelines and frameworks, conducting impact assessments, ensuring diverse and inclusive teams, and fostering public dialogue and collaboration

Real-world applications of AI and ML

AI and ML are transforming various industries by enabling new products, services, and business models
Understanding the real-world applications of AI/ML across different domains helps organizations identify opportunities for digital transformation and innovation

AI/ML in healthcare and life sciences

AI/ML applications in healthcare include medical image analysis (disease detection, segmentation), drug discovery and development, and personalized medicine
ML algorithms can analyze electronic health records (EHRs) to identify patterns, predict patient outcomes, and optimize treatment plans
AI-powered chatbots and virtual assistants can provide patient support, triage, and remote monitoring services
In life sciences, AI/ML is used for genomic data analysis, protein structure prediction, and biomarker discovery

AI/ML in finance and banking

AI/ML applications in finance include fraud detection, credit risk assessment, algorithmic trading, and portfolio optimization
ML models can analyze vast amounts of financial data (transaction records, market data) to identify fraudulent activities and assess the creditworthiness of borrowers
AI-powered chatbots and virtual assistants provide personalized financial advice and customer support
Robo-advisors use ML algorithms to automate investment portfolio management and provide low-cost, accessible investment services

AI/ML in retail and e-commerce

AI/ML applications in retail and e-commerce include personalized product recommendations, dynamic pricing, and demand forecasting
ML algorithms analyze customer data (browsing history, purchase records) to provide targeted product recommendations and optimize pricing strategies
Computer vision and natural language processing techniques enable visual search, product categorization, and sentiment analysis of customer reviews
AI-powered chatbots and virtual assistants provide customer support, handle inquiries, and facilitate online transactions

AI/ML in manufacturing and logistics

AI/ML applications in manufacturing include predictive maintenance, quality control, and production optimization
ML models can analyze sensor data from manufacturing equipment to predict failures, optimize maintenance schedules, and improve overall equipment effectiveness (OEE)
Computer vision techniques enable automated visual inspection and defect detection on production lines
In logistics, AI/ML is used for route optimization, demand forecasting, and inventory management, improving efficiency and reducing costs in supply chain operations

Key Terms to Review (32)

Accuracy: Accuracy refers to the degree to which a result or measurement conforms to the true value or standard. In data analysis and machine learning, accuracy indicates how well a model performs in predicting outcomes correctly, with higher accuracy reflecting better performance. This concept is vital across various domains, as it ensures reliability in decision-making processes driven by data insights.

Activation Functions: Activation functions are mathematical equations used in artificial neural networks to determine whether a neuron should be activated or not based on its input. They introduce non-linearity into the model, allowing it to learn complex patterns and make better predictions. By transforming the output of a neuron, activation functions play a critical role in the learning process of AI and machine learning algorithms.

Algorithmic bias: Algorithmic bias refers to systematic and unfair discrimination that arises from the algorithms used in artificial intelligence (AI) and machine learning (ML) systems. This bias can lead to unequal treatment of individuals based on attributes such as race, gender, or socioeconomic status, often reflecting pre-existing societal inequalities. Understanding algorithmic bias is crucial for developing fair and equitable AI systems and ensuring corporate digital responsibility in deploying these technologies.

Andrew Ng: Andrew Ng is a prominent computer scientist and entrepreneur known for his contributions to artificial intelligence (AI) and machine learning (ML). He co-founded Google Brain, an AI research project at Google, and is a leading figure in online education through platforms like Coursera, making advanced knowledge in AI accessible to a broader audience.

Artificial Intelligence: Artificial intelligence (AI) refers to the simulation of human intelligence in machines programmed to think and learn like humans. AI encompasses a variety of technologies that enable machines to perform tasks that typically require human cognitive functions, such as problem-solving, understanding natural language, and recognizing patterns. This capability not only drives the process of digital transformation but also plays a pivotal role in innovation, reshaping business models, enhancing competitive advantage, and fostering digital skills development.

Classification: Classification is the process of organizing data into predefined categories or classes based on shared characteristics. This concept is central to machine learning and artificial intelligence, as it allows algorithms to make predictions about new data points by identifying which category they belong to based on training data.

Convolutional Neural Networks: Convolutional Neural Networks (CNNs) are a class of deep learning algorithms designed primarily for processing structured grid data like images. They use convolutional layers to automatically and adaptively learn spatial hierarchies of features from input images, enabling them to excel in tasks like image classification and object detection. CNNs play a crucial role in computer vision, leveraging their ability to recognize patterns and features in visual data.

Decision trees: Decision trees are a type of flowchart-like structure that helps in making decisions by mapping out different possible outcomes based on various conditions or attributes. They are widely used in artificial intelligence and machine learning to classify data and make predictions, providing a visual representation that simplifies the understanding of complex decision-making processes. The decision tree splits data into branches based on feature values, leading to outcomes or decisions that can be easily interpreted.

Deep learning: Deep learning is a subset of machine learning that uses neural networks with multiple layers to analyze and learn from large amounts of data. It mimics the human brain's ability to process information, allowing systems to recognize patterns and make predictions with high accuracy. This technique is especially effective in areas such as image and speech recognition, enabling advancements in automation and artificial intelligence.

Feature engineering: Feature engineering is the process of using domain knowledge to extract features from raw data that make machine learning algorithms work more effectively. It involves transforming and selecting the right variables that help improve model performance, allowing algorithms to learn more accurately from the data. This process is essential in AI and ML because the quality and relevance of features directly impact the ability of models to make predictions.

Generative Adversarial Networks: Generative Adversarial Networks (GANs) are a class of machine learning frameworks designed to generate new data instances that resemble a given training dataset. This innovative architecture pits two neural networks against each other: the generator, which creates fake data, and the discriminator, which evaluates its authenticity. The interplay between these networks results in the generator improving its outputs over time, leading to highly realistic data generation.

Geoffrey Hinton: Geoffrey Hinton is a prominent computer scientist known as one of the 'godfathers' of artificial intelligence and deep learning. His groundbreaking work in neural networks has significantly advanced the field of machine learning, particularly through the development of algorithms that allow machines to learn from large datasets. Hinton's contributions have laid the foundation for many modern AI applications, making him a key figure in understanding the fundamentals of AI and ML.

K-nearest neighbors: K-nearest neighbors (KNN) is a simple, yet powerful algorithm used in machine learning for classification and regression tasks. It works by identifying the 'k' closest data points in the training dataset to a given test point and making predictions based on the majority class (for classification) or the average value (for regression) of those neighbors. This method is intuitive and effective, particularly for problems where the relationships between data points are complex and not easily captured by linear models.

Loss functions: Loss functions are mathematical tools used in machine learning and artificial intelligence to quantify how well a model's predictions match the actual outcomes. They serve as a guide for optimizing the performance of models by providing feedback on the errors in predictions, which helps in adjusting parameters during the training process. By minimizing the loss, models can improve their accuracy and reliability when making predictions on new data.

Machine Learning: Machine learning is a subset of artificial intelligence that enables systems to learn from data, identify patterns, and make decisions with minimal human intervention. It plays a crucial role in harnessing data-driven insights for businesses, enhancing decision-making processes, and improving overall operational efficiency.

Model training: Model training is the process of teaching a machine learning algorithm to recognize patterns and make predictions based on data. During this phase, the algorithm learns from a set of training data, adjusting its internal parameters to minimize errors in its predictions. This process is essential for the development of robust AI systems capable of performing tasks such as classification, regression, and more.

Naive Bayes: Naive Bayes is a family of probabilistic algorithms based on applying Bayes' theorem with strong independence assumptions between the features. It’s particularly effective in classification tasks, where it predicts the class of an instance based on the probabilities derived from the training data. The 'naive' aspect comes from the assumption that all features are independent, which simplifies computation and allows it to perform surprisingly well in practice, especially with large datasets.

Neural networks: Neural networks are computational models inspired by the human brain, designed to recognize patterns and solve complex problems through a system of interconnected nodes or 'neurons'. These networks process data in layers, with each layer transforming the input into a more abstract representation, enabling applications such as image recognition, natural language processing, and predictive analytics.

Precision: Precision refers to the degree to which repeated measurements or calculations yield consistent results. It is a crucial aspect of evaluating the quality of data, especially in fields that rely on statistical and algorithmic models, where accuracy may be impacted by variability or noise. In the context of machine learning and predictive analytics, precision measures how many of the correctly predicted positive instances out of all the predicted positive instances were actually relevant.

Predictive Analytics: Predictive analytics is the use of statistical algorithms, machine learning techniques, and historical data to identify the likelihood of future outcomes. This process helps organizations make informed decisions by analyzing trends and patterns to forecast what could happen in the future, influencing strategies and operations across various domains.

PyTorch: PyTorch is an open-source machine learning library widely used for developing deep learning applications. Known for its flexibility and ease of use, PyTorch allows developers to build and train neural networks with dynamic computation graphs, enabling faster experimentation and debugging. This makes it a preferred choice for researchers and developers in artificial intelligence and machine learning fields.

Random Forests: Random forests is an ensemble learning method used for classification and regression that constructs a multitude of decision trees during training time and outputs the class that is the mode of the classes or mean prediction of the individual trees. This technique enhances accuracy and helps prevent overfitting by combining multiple decision trees, thus leveraging the power of diversity in predictions.

Recurrent neural networks: Recurrent neural networks (RNNs) are a class of artificial neural networks designed for processing sequences of data by maintaining a 'memory' of previous inputs through recurrent connections. This ability to retain information about previous inputs makes RNNs particularly well-suited for tasks involving time series data, speech recognition, and natural language processing, where the order and context of information are crucial for understanding meaning.

Regression: Regression is a statistical method used in data analysis and machine learning to understand the relationship between variables. It helps in predicting a continuous outcome based on one or more predictor variables, making it a fundamental technique in both AI and machine learning for tasks like forecasting and trend analysis.

Regularization techniques: Regularization techniques are methods used in machine learning and statistics to prevent overfitting by adding a penalty term to the loss function. By doing so, these techniques help to improve the model's generalization to unseen data, ensuring that it performs well not just on training data but also in real-world applications. Regularization balances the model's complexity and the accuracy of predictions, which is crucial in both AI and predictive modeling contexts.

Semi-supervised learning: Semi-supervised learning is a machine learning approach that combines a small amount of labeled data with a large amount of unlabeled data during training. This method leverages the strengths of both supervised and unsupervised learning, enabling models to achieve better accuracy and generalization. By using the available labeled data to guide the learning process, semi-supervised learning can effectively extract patterns from unlabeled data, making it especially useful in situations where labeling data is expensive or time-consuming.

Supervised learning: Supervised learning is a type of machine learning where an algorithm is trained on labeled data, meaning that the input data is paired with the correct output. This approach allows the algorithm to learn the relationship between inputs and outputs so it can make predictions or classifications on new, unseen data. It's essential in AI because it helps systems improve their accuracy by learning from past examples.

Support Vector Machines: Support Vector Machines (SVMs) are supervised learning models used for classification and regression tasks, aiming to find the optimal hyperplane that best separates data into different classes. By transforming data into higher dimensions, SVMs handle non-linear boundaries effectively, making them powerful tools in machine learning. Their ability to maximize the margin between classes helps improve predictive performance.

Tensorflow: TensorFlow is an open-source machine learning framework developed by Google that enables developers to build and deploy machine learning models. It provides a comprehensive ecosystem for working with deep learning, neural networks, and other advanced AI technologies, allowing for efficient numerical computation and easy scalability across devices.

Training data: Training data refers to a set of examples used to teach machine learning algorithms how to make predictions or decisions. This data is crucial because it helps the algorithm identify patterns, learn from inputs, and improve its performance over time. Quality training data ensures that the model generalizes well to new, unseen data, ultimately enhancing the reliability of AI applications.

Transparency: Transparency refers to the clarity and openness of processes, decisions, and data, enabling stakeholders to understand how actions are taken and outcomes are reached. This concept is vital in ensuring accountability and trust, especially in complex systems like AI, blockchain, and corporate practices where understanding decision-making processes can affect user confidence and ethical standards.

Unsupervised Learning: Unsupervised learning is a type of machine learning where the model is trained on data that has not been labeled or categorized. This approach allows algorithms to identify patterns and structures within the data without any prior knowledge of what the output should be. It plays a crucial role in discovering hidden relationships and features within datasets, making it essential for tasks like clustering, association, and dimensionality reduction.

AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.

Back

Practice QuizGlossary

Practice Quiz Glossary

6.1 AI and ML fundamentals

Defining AI and ML

AI as a broad concept

Top images from around the web for AI as a broad concept

Top images from around the web for AI as a broad concept

Machine learning as a subset of AI

Deep learning and neural networks

Key differences between AI and ML

Comparing rule-based vs learning systems

Exploring supervised vs unsupervised learning

Classification vs regression in ML

Essential components of ML systems

Importance of data in ML

Feature selection and engineering

Training, validation and test datasets

Common ML algorithms and models

Decision trees and random forests

Support vector machines (SVMs)

Naive Bayes classifiers

K-nearest neighbors (KNN)

Neural networks and deep learning

Basics of artificial neural networks

Convolutional neural networks (CNNs)

Recurrent neural networks (RNNs)

Generative adversarial networks (GANs)

ML development process and lifecycle

Problem definition and data collection

Data preprocessing and cleaning

Model selection and hyperparameter tuning

Deployment, monitoring and maintenance

Challenges and limitations of AI/ML

Bias and fairness in ML models

Interpretability and explainability

Handling concept drift and model decay

Ethical considerations in AI/ML

Real-world applications of AI and ML

AI/ML in healthcare and life sciences

AI/ML in finance and banking

AI/ML in retail and e-commerce

AI/ML in manufacturing and logistics

Key Terms to Review (32)

© 2024 Fiveable Inc. All rights reserved.

AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.

Back

Next guide