AWS Glue development requires that a developer endpoint should be running at all times. (In fact, technically it only has to run when the jobs are to be launched; however stopping the endpoint is not possible, and killing and re-creating it requires config changes which is a major hassle.) For smaller teams, in small or hobby projects it makes a lot of sense to develop and run Glue jobs locally, independently of AWS.&nbsp; This is possible with dockerized Spark - but AWS provides only limited support.<figure class="image"><img alt="How to Run Spark 3 Glue Jobs Locally With Docker?" src="https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/how_to_run_a34f6e058b.png"></figure>Although Spark 3 came out early June 2020, unfortunately AWS currently (as of October 2021) only provides a docker image with Spark 2.4. Fortunately Spark 3.1 engine is available in the cloud via the AWS console.It is beyond the scope of this post to contemplate on whether it’s worth switching from 2 to 3, I just want to show you how you can run Glue jobs locally with the Spark 3.1 engine.Note that this solution is based on the <a target="_blank" href="https://github.com/alrouen/local-aws-glue-v3-zeppelin">alruen</a> attempt, but that didn’t quite work for our purpose. (However, I would like to give a thumbs up from here as well!)Without further ado, you can try our pre-build image from <a target="_blank" href="https://hub.docker.com/repository/docker/hiflylabs/local-aws-glue-v3-zeppelin">here</a>&nbsp;or <a target="_blank" href="https://github.com/Hiflylabs/aws-glue-spark3-docker">here</a>&nbsp;you will find everything you need to know for local build or customization.Beyond <a target="_blank" href="https://hiflylabs.com/blog/tag/docker">docker</a>, you need the AWS command line interface (<a target="_blank" href="https://aws.amazon.com/cli/">AWS cli</a>). After installing it you must authenticate with the aws-configure command.You can then start the container with the following command:docker run -it - -rm - -name local-glue -p 8080:8080 -p 9001:9001 -v $PWD/logs:/logs -v $PWD/notebook:/notebook -e ZEPPELIN_LOG_DIR='/logs' -e ZEPPELIN_NOTEBOOK_DIR='/notebook' -v ~/.aws:/root/.aws:ro hiflylabs/local-aws-glue-v3-zeppelin:v1Note that the ~/.aws:/root/ path represents the default location of the aws configuration files (which were created by the aws-configure command).Once the container has started, you can start coding in the Zeppelin notebook, which you can access at http://127.0.0.1:9001, or you can attach the running container from VS code. Although it is not part of this tutorial, you can read more about this <a target="_blank" href="https://code.visualstudio.com/docs/remote/attach-container">here</a>.If all went well, you can now successfully develop AWS glue jobs locally on your own machine with Spark version 3; you don’t need either the AWS console nor a developer endpoint.<a target="_blank" href="https://www.linkedin.com/in/zsombor-flds/">Zsombor Földesi</a> - Data EngineerYou can find our other blog posts <a target="_blank" href="https://hiflylabs.com/blog">here</a>.

How to Run Spark 3 Glue Jobs Locally With Docker?

large_how_to_run.png

small_how_to_run.png

medium_how_to_run.png

thumbnail_how_to_run.png

how_to_run.png

AI… It's a tide of hype, promises, and&nbsp;some&nbsp;genuine breakthroughs.&nbsp;<blockquote>In both the technology and consulting markets, everything is about AI right now, and all movements should be examined in this light.&nbsp;</blockquote>If you're in sales or management, it can be tough to tell reality from noise. My goal here is to give you a straightforward look at the market. We'll cover where AI truly stands today. This is part one of our series to get you up to speed.<h2>From "Wow!" to "How?"</h2>According to industry analysts at Gartner, AI has, by and large, reached the "Peak of Inflated Expectations." (This wave of AI, that is.) Now, we're entering the next phase. The conversation has shifted from "Wow, AI can do&nbsp;anything!" to "Okay,&nbsp;how do we actually make money with it?"<figure class="image"><img alt="et-hc-2025-press-release.jpg" src="https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/et_hc_2025_press_release_6677240add.jpg" srcset="https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/et_hc_2025_press_release_6677240add.jpg 245w, https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/et_hc_2025_press_release_6677240add.jpg 500w, https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/et_hc_2025_press_release_6677240add.jpg 750w" sizes="100vw" width="750"></figure>The Gartner Hype Cycle for AI, 2025. Source: © Gartner, Inc. and/or its affiliates. All rights reserved.Compared to last year, pragmatism is taking over.&nbsp;<h2>The most notable trends</h2><ul><li>Generative AI (GenAI) has moved past the peak of inflated expectations toward more realistic&nbsp;use cases with tangible ROI.</li><li>Interest in AI agents and&nbsp;AI-ready data is growing.&nbsp;</li><li>AI-driven solutions move from labs to standard business processes, and it doesn’t work in “cowboy style” in production:&nbsp;AI Engineering and model&nbsp;operationalization&nbsp;(ModelOps) are key to creating and scaling sustainable AI solutions.&nbsp;</li><li>AI-assisted coding has strongly impacted the developer market and developer productivity. This has been, perhaps, the most visible and ubiquitous use case so far.</li><li>The focus is shifting to&nbsp;practical applications and scalability, while organizations must manage ethical and regulatory challenges.</li><li>Today, "document-chat" type applications are self-evident; the novelties are moving towards agents (or agentic systems) that&nbsp;perform tasks on their own. This also means that RAG applications can now be reliably developed and implemented in organizations; those who want to be on the leading edge experiment with agentic systems and more robust AI ontologies.</li></ul>In my opinion, the hype will gradually turn into pragmatism, and "AI" will become a part of normal life. I definitely don't expect a "one AI to rule them all" in the near future. Artificial general intelligence is less of a key issue for the economy right now. The important milestone is a sufficiently stable, strong enough, "purpose-driven", practically applicable AI in the workplace. In many areas, we are already there; in other areas, we are quite close; in certain complex topics, we are quite far. But AI, as in LLMs, are already "good enough" for many business applications.&nbsp;Most major innovations were quite immature and rough at their first applications. They got refined, sometimes even significantly altered, along the way, before reaching their mature status (which, itself, is rarely the ultimate one). Think about firearms, combustion engines, electricity, airplanes, etc.&nbsp; Whoever applied them whenever they became fit for a purpose got ahead a lot compared to those waiting for the innovation to get more sophisticated.Generative AI (and AI in general) is no exception.&nbsp; It’s not “there” yet, we have not had AGI, and much less general superhuman intelligence. It’s something we can think about and even make bets as to when it will “appear”.&nbsp; But a more practical question is “How can we make use of what we already have today?”.As more and more mature solutions are engineered around LLMs, monetization will strengthen further.&nbsp;<blockquote>Also, I strongly believe that the general public focus will shift from core technologies to their applications. The question will not be "which model" to use, but "what for" and "how". If the use case is well thought out and the software works, there will be much less pressure to replace the underlying models with every new release.&nbsp;</blockquote>AI is to become more deeply and intensively integrated into a growing part of business solutions (both products and services), and into corporate operations. There will be AI in everything. Initially often "blindly," and many times for the sake of riding the hype, but after a while, gradually, it will be mostly useful. At the dawn of microelectronics, radios were marketed with the number of transistors they had; and who cares about it anymore? Even portable radios, themselves, are a rare sight these days, but it is still common to get to music and news via radio waves. That’s how I think today’s AI is to evolve.Having said all this, let me also gamble with my chips: I predict that the pace of foundational research in AI in its current form is more likely to slow down than to accelerate. There will be new results, perhaps even significant ones along current directions. There will be more fine-tuned, smaller models for more niche applications. But for a quantum leap towards a more complete AI, I think we will need new paradigms: LLMs are to be an important part but I am quite certain that language and the way it is modelled via LLMs in and of itself is not sufficient to represent a complete view of the world’s realities. This new paradigm may already be emerging–but I cannot yet put my finger on it. Thus I do not expect a paradigm-shattering breakthrough in the next 2-3 years (e.g., a widespread, "good for everything" AGI). Note, though, that LLMs have taken us quite far, farther than most (me included) had expected; and the field is very volatile at the moment. So I may well be wrong.&nbsp;We will see.In the next parts of this series, I'll take a look at how the world of&nbsp;data platforms,&nbsp;data-centric application development and&nbsp;analytics engineering&nbsp;is changing. So stay tuned!

large_CTO's take_2.png

medium_CTO's take_2.png

small_CTO's take_2.png

thumbnail_CTO's take_2.png

CTO's take_2.png

CTO Perspectives: An AI Reality Check

We built an automated product catalog auditor using BigQuery and Gemini to solve a common e-commerce problem: inconsistencies between product descriptions and images. Our solution, developed for a hackathon, can classify millions of listings, identify the root causes of errors, and provide actionable insights to improve data quality at scale. This project was our answer to the&nbsp;<a href="https://www.kaggle.com/competitions/bigquery-ai-hackathon">BigQuery AI - Building the Future of Data</a> hackathon featured by Google Cloud on Kaggle.In this article, we will demonstrate how to use BigQuery's built-in AI capabilities to process mixed-format data and tackle real-world business challenges directly within the database.<h2>When a picture&nbsp;isn't&nbsp;worth a thousand words</h2>You've probably experienced it before: you search for a "blue cotton t-shirt," click on a promising result, and the image shows a red polyester jacket. This mismatch between a product's title and its image is a huge headache for e-commerce platforms. It leads to customer confusion, erodes trust, and drives traffic away.For companies with millions of products, manually reviewing every listing is just impossible. This is where AI comes in. The hackathon prompted us to use BigQuery's AI features to solve a real problem with unstructured data, and this common problem of title-image inconsistency was a perfect fit.<h2>A strict AI auditor built with SQL</h2>We developed a solution that transforms BigQuery into a powerful, automated data quality engine. The core of our project is a single SQL query that uses the AI.GENERATE function to pass product titles and image URLs to the&nbsp;gemini-2.5-flash model.We instructed the model to act as a strict catalog auditor and classify each product into one of four categories:<ul><li>OK: The title and image are a perfect match.</li><li>MISMATCH: It's the same type of product, but a key attribute like&nbsp;brand, size, or model is different or missing.</li><li>ERROR: The title and image are for completely different products.</li><li>UNCERTAIN: There isn't enough evidence to make a confident decision.</li></ul>Critically, we also asked the model for&nbsp;explainability:<ol><li>Reasons: A human-readable explanation for its decision.</li><li>Salient Image Tags: Keywords describing the key visual features of the image.</li><li>Confidence Score: A numerical score (from 0 to 1) of its certainty.</li></ol>(You can find code and prompts for this in the notebook linked at the end of the article.)<h2>Key findings from our AI audit</h2>After running the analysis, we uncovered several key insights into the catalog's data quality.<h3>1. Inconsistency is widespread</h3>The audit revealed a nearly even split between consistent (OK) and inconsistent (MISMATCH + ERROR) listings. This confirmed our hypothesis that catalog quality was a significant issue, driven by incomplete titles (missing brand or volume) and noisy product photos.<img src="https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/image_697c41946d.png" alt="image.png" srcset="https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/image_697c41946d.png 190w, https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/image_697c41946d.png 500w" sizes="100vw" width="558" height="458"><h3>2. The model is confident in its decisions</h3>The vast majority of classifications had a confidence score between 0.9 and 1.0. This high level of certainty suggests we can largely trust the model's automated judgments for most cases, while the UNCERTAIN category effectively isolates the few ambiguous listings that require human review.<h3>3. The biggest problems were Brand and Volume</h3>By analyzing the keywords in the model's "reasons," we found the primary drivers of mismatches.&nbsp;"Brand" was mentioned over 15,000 times, with&nbsp;"volume" and&nbsp;"type" being the next most common culprits. This tells us that the most significant source of catalog error isn't wildly incorrect listings, but rather missing or conflicting details about specific attributes.<h2>A look at real examples</h2>To see the auditor in action, let's look at a few examples.<figure class="image"><img alt="image.png" src="https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/image_f0ef39a4fb.png" srcset="https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/image_f0ef39a4fb.png 245w, https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/image_f0ef39a4fb.png 500w, https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/image_f0ef39a4fb.png 750w, https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/image_f0ef39a4fb.png 1000w" sizes="100vw" width="1000"></figure><h4>✅ Perfect matches (OK)</h4><ul><li>TP-LINK Wireless N Router TL-WR940N 450Mbps: A fantastic example where the model perfectly matched the brand, model number, and technical specifications in the title to the image.</li><li>Safi Dermasafe Night Moisturiser 50 gr: The AI correctly identified the brand, product line, type, and volume, confirming a perfect match.</li></ul><h4>⚠️ Attribute mismatches (MISMATCH)</h4><ul><li>Nescafe Éclair Latte: The title was simple, but the image clearly showed a volume (220ml) that was missing from the title, triggering a MISMATCH.</li><li>MARKS &amp; SPENCER Rose Hand &amp; Body Lotion 250 ml: Here, the title included the brand and volume, but these details were not clearly visible on the bottle in the image, leading to a MISMATCH.</li></ul><h4>❌ Clear error (ERROR)</h4><ul><li>Plastic Kangaroo Toy: The title had nothing to do with the image, which showed Victoria’s Secret shopping bags and cupcakes. This is a classic example of a severe cataloging error that the system easily flagged.</li></ul><h2>Turning insights into action</h2>This solution provides a scalable, data-driven framework for e-commerce catalog management. Retailers can use these AI-generated insights to:<ul><li>Automatically approve tens of thousands of OK listings, saving countless hours of manual work.</li><li>Prioritize human review for the listings flagged as MISMATCH or ERROR, focusing attention where it's needed most.</li><li>Identify and fix systemic issues. For example, if "brand" is a recurring reason for mismatches, they can enforce stricter data entry rules for that field.</li></ul>By embedding AI directly within the database, we've shown that BigQuery can be more than just a data warehouse, and that it can be an active, intelligent engine for ensuring data quality and solving real-world business problems.We embraced the&nbsp;Multimodal Pioneer approach from the hackathon by combining text and images to tackle a real-world business problem directly within BigQuery and demonstrated that you don't need a separate system to process mixed-format data. It can all happen within a single environment that feels like an extension of SQL, and you can realize a powerful, scalable, end-to-end AI workflow that turns overlooked data into actionable insights.&nbsp;&nbsp;If you’d like to dig into the details,&nbsp;<a href="https://colab.research.google.com/drive/1G97Re7xiF3UJCfCCaUHf_lK51QHOoSS9?usp=sharing">find the notebook of this solution under this link</a>.

large_IMG_5907.png

medium_IMG_5907.png

small_IMG_5907.png

thumbnail_IMG_5907.png

IMG_5907.png

Auditing Webshop Listings: How We Turned BigQuery into an AI Data Quality Engine

Over the last 12 months, we’ve seen AI evolve from reactive assistants into autonomous agents—capable of making decisions, calling APIs, triggering workflows, and even collaborating with other agents.But as enterprises rush to prototype and deploy these intelligent agents, a critical blind spot is emerging: AI agent governance.At Hiflylabs, we have built systems that have literally several dozens of agents, acting on behalf of specific users or as parts of background pipelines. They read all kinds of data from databases, through JSON documents and text-ish files, to PDFs and images, extract and mangle data from them, decide their own best course of action, and, well, execute it. Many thousands, if not millions, of times–in a matter of days.We have actually seen that there needs to be operational cadence around them: from their inception, through their deployment, to their activities.<h2>What are AI agents—and why are they different?</h2>An AI agent isn’t just a chatbot or a language model API call. It’s a system that combines reasoning, memory, and—most crucially—action. An agent can:<ul><li>Access structured or unstructured data</li><li>Decide on a course of action</li><li>Execute tasks via plugins, tools, or API integrations</li><li>Learn from results and iterate</li></ul>Frameworks like LangChain, AutoGen, CrewAI, and enterprise copilots built on OpenAI’s function calling are making this possible today.Agents are not static. They evolve. They “think.” And most importantly, they act.<h2>Why AI agent governance matters now</h2>Without strong governance, autonomous agents introduce a number of risks:<ul style="list-style-type:disc;"><li>Uncontrolled access to data. Agents access internal databases, documents, or APIs in the course of their normal operation, and thus may expose sensitive or regulated data.</li><li>Opaque decision-making. Unlike traditional scripts, agent behavior can vary based on context or prior actions. Without logging and traceability, it’s hard to explain why an action was taken.</li><li>Unintended consequences. Agents can misinterpret user intent, chain incorrect actions, or even produce biased or non-compliant outputs—which can be disastrous, especially in customer-facing roles.</li><li>Security vulnerabilities. Agents that accept inputs and call functions can be vulnerable to adversarial manipulation (e.g., prompt injection).</li></ul>As organizations increasingly experiment with internal copilots or agent-based automation, these risks scale fast.<h2>What does agent governance look like?</h2>To govern agents effectively, organizations will need to establish a multi-layered framework, combining technical controls, monitoring, and policy alignment.Here are some of the key building blocks you should consider for your AI agent governance framework:<img src="https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/Building_Blocks_of_AI_Agent_Governance_Framework_eee5574220.png" alt="Building Blocks of AI Agent Governance Framework.png" srcset="https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/thumbnail_Building_Blocks_of_AI_Agent_Governance_Framework_eee5574220.png 245w,https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/small_Building_Blocks_of_AI_Agent_Governance_Framework_eee5574220.png 500w,https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/medium_Building_Blocks_of_AI_Agent_Governance_Framework_eee5574220.png 750w,https://hiflylabswebstorage.blob.core.windows.net/hiflylabs-web-strapi-container/assets/large_Building_Blocks_of_AI_Agent_Governance_Framework_eee5574220.png 1000w," sizes="100vw" width="1920" height="579"><h3>1. Agent registry &amp; identity</h3><ul><li>Track each agent’s purpose, capabilities, tools, and access scope.</li><li>Log version history and behavior changes over time.</li><li>Track ownership (both technical and business).</li><li>Be in the know about where the agent is deployed and what its running profiles are (e.g., schedules, activations).</li></ul>It may sound dry, but we have repeatedly found agents for which we couldn’t figure out who commissioned them and what business process they are part of.<h3>2. Access &amp; permissions</h3><ul><li>Enforce strict least-privilege access to data and APIs.</li><li>Use identity-aware proxies or policy engines to filter requests in real time.</li></ul>Do not allow sloppiness in the name of “moving fast”, especially with critical connections. Connecting through a service account with elevated privileges may ease the pressure for the moment. But it is truly very rare for a project or an organization to circle back and fix these issues. Agents, especially publicly available ones, will be targeted with malicious intent (perhaps through other agents…)—if they are not already.<h3>3. Observability &amp; logging</h3><ul><li>Log every decision, action, and data source an agent touches.</li><li>Enable traceable outputs to support audits or human review.</li><li>Implement and operate automatic monitoring with a keen “eye” on suspicious activity. Sometimes it may even be advisable to proactively stop erratic agents until human review clears them.</li></ul>Being in the blind and not having a clue where to start investigating when “something” happens is rather frustrating. Agents are genuine, powerful software. Make sure to keep an eye on them accordingly.&nbsp;<h3>4. Human-in-the-loop mechanisms</h3><ul><li>Set thresholds for human approval for high-impact or high-risk actions.</li><li>Allow rollback or override of agent decisions when necessary (and when possible 😉).</li></ul><h3>5. Testing &amp; simulation</h3>Before deploying an agent, run it through sandbox environments with synthetic tasks to evaluate behavior under edge cases or adversarial input.Honestly, we have found this kind of testing rather difficult. The industry has not learned the angles and ways these agents are (to be) attacked from, and thus it is often hard to compile an effective test set.<h3>6. Continuous evaluation</h3><ul><li>Regularly test for drift and performance degradation.</li><li>Include agents in your broader model validation and governance workflows.</li></ul>AI models change on very short cycles, and agent owners tend to want to (or have to) upgrade to the latest ones. As agent adoption grows (and it grows quickly), use cases, inputs, and access patterns are likely to change in unanticipated ways. Make sure you have mechanisms to stay on top of it.<h2>Strategic priority for tech leaders</h2>CIOs and CTOs have a narrow window to get ahead of this shift. As autonomous agents begin performing more operational tasks, governing them becomes a strategic requirement, not just a technical nice-to-have.It is hard for several reasons. One of the primary ones is that there is urgency everywhere, and anything that interferes with the "conceived yesterday, vibe-coded last night, let’s deploy to production today” rush is most often seen as pointless fussing and gets worked around as much as possible.Thus, it is of utmost importance that AI agent governance practices are devised and implemented such that they are effective and at the same time heavily streamlined and automated.Here’s what smart leaders are doing today:<ul><li>Involving security and data governance teams early in agent initiatives.</li><li>Defining organizational policies on agent autonomy, scope, and auditability.</li><li>Creating streamlined, automated processes.</li><li>Building cross-functional steering groups to evaluate risk and opportunity.</li><li>Partnering with trusted consultancies to architect safe, scalable agent frameworks.</li></ul><h2>Final thought</h2>Just like APIs changed software architecture, AI agents are set to change how businesses automate and scale decision-making. But without governance, what starts as innovation can quickly become chaos.If you're prototyping or scaling AI agents in your organization, now is the time to ask:<blockquote>Who watches the agent?</blockquote>The answer should come from the top—guided by smart, proactive governance.&nbsp;Looking to adopt AI for productivity, but without the common risks and operational headaches? We've solved this before. <a href="https://hiflylabs.com/ai-and-data-science">Show me how to make AI work</a> &nbsp;