How AI learns from data

Supervised learning

Supervised learning is the most straightforward type of AI training and often the first one people think of. In supervised learning, the AI model is trained on labeled data—data that includes both the input and the correct output. Each example acts like a teacher, helping the model learn the “right answers” so it can recognize similar patterns in new data.

To understand supervised learning, let’s use an analogy. Imagine you’re teaching a dog to respond to commands like “sit” or “stay.” You show the dog the command (input) and guide it through the action (output), giving it treats when it gets it right. Eventually, the dog learns to associate each command with the correct action.In supervised learning, the AI is given examples of inputs and their corresponding outputs.

For instance, if we’re training an AI to recognise photos of cats and dogs, we’d feed it thousands of labeled images, each marked “cat” or “dog.” The model uses these labels to learn what a cat looks like versus a dog. By the end of training, it can identify a cat or dog in new, unlabeled images by applying what it learned from the labeled examples. Supervised learning is especially useful when we have a large amount of labeled data and a clear goal. Here are some examples:

Email filters

An AI model trained to classify emails as “spam” or “not spam” based on labeled examples of each.

Medical diagnosis

Models that predict diseases by learning from labeled patient records (“disease” or “no disease”).

Image recognition

Identifying objects in images, like animals, cars, or plants, based on labeled training images.

Unsupervised learning

Unsupervised learning is different. It doesn’t rely on labeled data. Instead, the AI is given unlabelled data and tasked with finding patterns or relationships. Imagine you’re given a big box of photos but no information about what’s in them. Your task is to sort them into groups that to belong together, maybe by people, places, or time periods. You might start noticing that some photos have similar colours, people, or locations and group them accordingly. You don’t know the exact labels, but you can still organise them based on the patterns you observe.

In unsupervised learning, the AI does something similar. It looks for patterns and similarities within the data to create clusters or groups. For instance, it might look at a large dataset of customer purchases and group similar items together without knowing what each item is. This type of learning is ideal when we want to explore data and discover hidden patterns or relationships. Here are a few common examples.

Customer segmentation

AI models can analyse purchasing patterns to create customer groups with similar buying habits, even without knowing specific customer details.

Anomaly detection

Detecting unusual patterns, like identifying unusual bank transactions that could indicate fraud.

Data visualisation

Reducing large, complex datasets into simpler visual patterns that humans can easily interpret.

Reinforcement learning

‍Reinforcement learning is a unique approach where AI learns through a system of rewards and punishments, much like a game. Rather than learning from labeled data, it learns by interacting with an environment and receiving feedback based on its actions. Over time, it adjusts its strategy.
‍
Picture yourself training a new puppy to fetch a ball. You throw the ball, and every time the puppy retrieves it, you give it a treat (reward). If the puppy ignores the ball, it doesn’t get a treat. Eventually, the puppy learns that fetching the ball leads to treats and starts doing it more often. Reinforcement learning works in the same way. The AI is put in an environment where it can try different actions. Each action is either rewarded or punished based on how close it gets to the desired outcome. Over time, the AI learns which actions lead to rewards, adjusting its strategy to perform better.

One of the key concepts in reinforcement learning is called the reward function. This is the feedback system that tells the AI how well it’s doing, allowing it to learn from its successes and mistakes. Reinforcement learning is well-suited for situations where an AI needs to make a series of decisions to achieve a goal, particularly in dynamic environments. Here are some real-world examples:

Game AIs

Models that play games like chess, Go, or even complex video games use reinforcement learning to develop winning strategies.

Robotics

Teaching robots to perform tasks, like stacking boxes or navigating around obstacles, where each action affects the next.

Self-driving cars

Cars learn to make safe driving decisions by receiving rewards for correct actions, like stopping at a red light, and punishments for mistakes.

Bringing it all together

Choosing the right learning type depends on the problem. If you have clear, labeled data, supervised learning is often the best choice. If you’re exploring new data without clear answers, unsupervised learning helps uncover patterns. And if your goal involves a series of decisions leading to an end goal, reinforcement learning might be the way to go.

Imagine you’re using a virtual assistant, like Alexa or Siri, and you ask it to play a song you love. Supervised learning helps the assistant recognise your voice and understand the words you’re saying. It has been trained on thousands of labeled audio samples to convert your speech into text. Unsupervised learning helps it recommend similar songs or group playlists, even without explicit information about your preferences. Reinforcement learning guides the assistant to improve over time, as it learns from user feedback (you keep skipping certain songs) to offer better recommendations.

Learning is a mix of techniques, all working to create smart, adaptable systems that get better over time. Understanding these methods not only helps us appreciate the complexity behind AI but also gives us insight into its limitations. Each learning type has its strengths, and together, they allow AI to handle a wide variety of tasks and challenges in our data-filled world.

Supervised learning

Unsupervised learning

Reinforcement learning

Bringing it all together

Big tech, for the everyday business