November 15, 2021

Anomaly Detection with the k-NN Model

Vinay Senthil

Finding a defective item amongst thousands of non-defective objects is an essential task across industries. For example, identifying a dented can on a production line manufacturing thousands of cans per minute or spotting a defective solar panel in a grid stretching miles are both crucial tasks in their respective industries. People often refer to this as “anomaly detection,” as you’re looking for something that’s not the way it should be.

Traditionally, this form of repetitive visual inspection has been time consuming, demanding hours of human labor to process trillions of pixels. However, organizations have unlocked real business value and freed human creativity by integrating cutting-edge computer vision to augment human sight and supercharge anomaly detection workflows. In some cases, AI has improved manufacturing defect detection rates by 90% and can identify problems in a CT scan up to 150 times faster than it would take a physician. In fact, CrowdAI receives numerous requests from clients who want to deploy computer vision in their anomaly detection workflows.

Why anomaly detection is a unique computer vision problem

In anomaly detection, a learning algorithm studies a set of example media called the training set. Then, based on what it sees, the algorithm determines if new images are “normal” or “anomalous.” This process is known as supervised learning, since we’re the ones flagging examples of anomalies for the algorithm to learn. However, supervised learning will only take you so far. First, anomalies can take a virtually infinite number of forms, making it near impossible to teach a model to recognize every single irregularity that can occur in the real world. Second, a problem called class imbalance may occur, which is when we have too many samples of media depicting “normal” objects and not enough “anomalous” object examples, usually due to the rarity of anomalies in practice.At CrowdAI, our research team is experimenting with innovative model architectures that leverage unsupervised learning, which doesn’t require humans to label examples of anomalies in order for the model to learn. With our approach, we can form an understanding of what “normal” objects look like, then train a model that considers anything outside that description to be anomalous.

Warning: we’re moving into some technical territory!

Threshold Distancing and Anomaly Detection

One unsupervised approach uses a k-Nearest Neighbors (k-NN) model, a distance-based thresholding technique, to determine if an object is anomalous or not. Here’s an overview of how that might work.

Imagine we want to find faulty boxes on a production line. We would start by asking ourselves the question, ”what makes a box a box?” Specifically, what features are so essential to the concept of a box such that we’d only call a given object a box if they were present?

‍

From individual features to a feature space

We answer this question by defining these essential features of a box. We then turn the range of possibilities of a particular feature (such as height) into coordinates that can be plotted on a graph. The result is what machine learning engineers call a feature space, an abstract representation of the original images on a graph. The closer two dots are to each other in the feature space, the more similar the features of those images (e.g. heights) are, indicating that the original boxes are similar. Distance-based thresholding uses this type of understanding of an object and its features to establish a standard of what is considered a “normal” object or box.

Remember, if we plot the features of all “normal” boxes, we will see a tightly-packed cluster of points in the feature space—they are near one another because they are similar. We then can classify any box whose features fall at least a certain distance away from this cluster as “anomalous”.

For example, the average height and width of the boxes in our data set could be 5 inches and 7 inches, respectively. We can then make a rule (or threshold) that any box that is more than one inch away from these averages can be considered anomalous. With this threshold, we can now classify a box with features (x, y), where x is its height and y is its width. A box with (5.5, 7) is classified as “normal”, since it is within the one-inch threshold, whereas a box with (8, 9) would be deemed anomalous. Determining this tolerated range is the foundational concept behind distance-based thresholding and anomaly detection.

Below is an example of a workflow for anomaly detection in the CrowdAI Platform.

At CrowdAI, we believe that anomaly detection in combination with unsupervised learning offers many advantages: enterprise agility by reducing the need for labeled images, shorter model training times, and minimized modifications to model architecture. We’re continuing to explore the uses for various anomaly detection models alongside several other techniques to help our clients quickly realize improved business outcomes.

For more deep-dives into anomaly detection and other exciting research in computer vision, follow our LinkedIn page, subscribe to our mailing list, and continue to read the “CrowdAI Research blog series”.

May 22, 2023

“Small Devices, Big Impacts: Streaming Computer Vision Models at the Edge”

Running a computer vision model on a cell phone or mobile device is a powerful tool that can enable real-time analysis of images and videos, which can be useful in a variety of applications. While there are challenges to streaming computer vision models on small devices, CrowdAI has developed a roadmap of techniques and tools to overcome these challenges. By leveraging cloud driven API connections for invoking inference from a trained model, CrowdAI sees a pathway to real-time analysis of imagery and video on small devices operating at the edge. Additionally, the geospatial benefits of building models from media captured on cell phones can offer unique advantages for training, monitoring, and analyzing objects of interest.

Zeke Foppa and Taylor Maggos

May 8, 2023

Deploy Anywhere; Use Every Camera: The Power of the CrowdAI Platform

In today's world, where we are surrounded by computers and cameras of all types and sizes, it's essential for machine learning services to be deployment-agnostic and camera-agnostic. Being able to work in any cloud, hardware, or software environment; and to use any camera or sensor is an invaluable advantage that has become increasingly important in recent years as the use of cameras has exploded in various industries. These features allow for greater flexibility and ease of use—exactly what CrowdAI strives to provide—enabling ML to be used in a wider range of applications.

Patrick Collins and Taylor Maggos

May 1, 2023

Exploring how SAM and GroundingDino Increase Opportunities to Accelerate Semi- and Fully Automated Bounding Box Data Labeling

Going from a complex segmentation model to a simpler bounding box object detection model using SAM may seem like a bit of overkill, but there are some instances where an object detection model is favored over a segmentation model. For example, if we have a photo of a street with a bunch of pedestrians, a detection model can provide insight into how many people are there, their location in the frame, and how they interact with each other; segmentation masks wouldn’t give us as useful information since they would just be silhouettes of standing or walking people. Another benefit is that object detection models are designed to be more robust to variations in object size, rotation, and aspect ratio, making them ideal for identifying objects with diverse geometries. Lastly, when computational resources are limited, object detection models tend to be less computationally intensive than segmentation models, which can require more processing power and memory to run efficiently.

Zeke Foopa and Taylor Maggos