
AI Agent Governance: What to Prepare for as AI Enters Your Stack
AI agents are here, and they’re not just suggesting actions. They are taking actions as well, on their own! The question is: who’s governing them?
E-commerce platforms struggle with mismatched product titles and images. We built a solution for a Kaggle challenge that uses BigQuery and Gemini to act as an automated catalog auditor, classifying millions of listings for consistency and providing actionable insights to improve data quality.
We built an automated product catalog auditor using BigQuery and Gemini to solve a common e-commerce problem: inconsistencies between product descriptions and images. Our solution, developed for a hackathon, can classify millions of listings, identify the root causes of errors, and provide actionable insights to improve data quality at scale. This project was our answer to the BigQuery AI - Building the Future of Data hackathon featured by Google Cloud on Kaggle.
In this article, we will demonstrate how to use BigQuery's built-in AI capabilities to process mixed-format data and tackle real-world business challenges directly within the database.
You've probably experienced it before: you search for a "blue cotton t-shirt," click on a promising result, and the image shows a red polyester jacket. This mismatch between a product's title and its image is a huge headache for e-commerce platforms. It leads to customer confusion, erodes trust, and drives traffic away.
For companies with millions of products, manually reviewing every listing is just impossible. This is where AI comes in. The hackathon prompted us to use BigQuery's AI features to solve a real problem with unstructured data, and this common problem of title-image inconsistency was a perfect fit.
We developed a solution that transforms BigQuery into a powerful, automated data quality engine. The core of our project is a single SQL query that uses the AI.GENERATE function to pass product titles and image URLs to the gemini-2.5-flash model.
We instructed the model to act as a strict catalog auditor and classify each product into one of four categories:
Critically, we also asked the model for explainability:
(You can find code and prompts for this in the notebook linked at the end of the article.)
After running the analysis, we uncovered several key insights into the catalog's data quality.
The audit revealed a nearly even split between consistent (OK) and inconsistent (MISMATCH + ERROR) listings. This confirmed our hypothesis that catalog quality was a significant issue, driven by incomplete titles (missing brand or volume) and noisy product photos.
The vast majority of classifications had a confidence score between 0.9 and 1.0. This high level of certainty suggests we can largely trust the model's automated judgments for most cases, while the UNCERTAIN category effectively isolates the few ambiguous listings that require human review.
By analyzing the keywords in the model's "reasons," we found the primary drivers of mismatches. "Brand" was mentioned over 15,000 times, with "volume" and "type" being the next most common culprits. This tells us that the most significant source of catalog error isn't wildly incorrect listings, but rather missing or conflicting details about specific attributes.
To see the auditor in action, let's look at a few examples.
This solution provides a scalable, data-driven framework for e-commerce catalog management. Retailers can use these AI-generated insights to:
By embedding AI directly within the database, we've shown that BigQuery can be more than just a data warehouse, and that it can be an active, intelligent engine for ensuring data quality and solving real-world business problems.
We embraced the Multimodal Pioneer approach from the hackathon by combining text and images to tackle a real-world business problem directly within BigQuery and demonstrated that you don't need a separate system to process mixed-format data. It can all happen within a single environment that feels like an extension of SQL, and you can realize a powerful, scalable, end-to-end AI workflow that turns overlooked data into actionable insights.
If you’d like to dig into the details, find the notebook of this solution under this link.