November 7, 2025
If AI is so smart, why can’t it tell me how much I can trust its answers? Well, maybe it can… We talked about creating “Golden Datasets” – benchmarks that help us evaluate an answer from an AI model. (If… Continue Reading…
November 7, 2025
If AI is so smart, why can’t it tell me how much I can trust its answers? Well, maybe it can… We talked about creating “Golden Datasets” – benchmarks that help us evaluate an answer from an AI model. (If… Continue Reading…
November 7, 2025
Ahoy, mateys! In our last webinar (recording is coming soon), “AI Testing: Beyond the Basics” we did something amazing: we built a functional “AI Testing Kayak.” We created a frameworkf for testing AI products, that’s fast, practical, and, most importantly,… Continue Reading…
October 16, 2025
Let’s talk about AI responses. You ask your genie a question, you get an answer. But is it a good answer? How do we evaluate a response that isn’t deterministic, isn’t repeatable, and might have some nuance? LLM answers are… Continue Reading…
October 28, 2025
Let’s talk about that test. The one that’s always flaky. The one that takes twenty minutes to run and fails for a different reason every time. Your first instinct is to blame the test. Maybe the locator is wrong, maybe… Continue Reading…
November 7, 2025
Let’s talk about the new math of the AI revolution. The promise is simple and seductive: your developers can now deliver five times faster. Your CEO loves it. Your product managers are ecstatic. The whole company is high on the… Continue Reading…
September 30, 2025
Let’s talk about our new AI assistants (soon to be overlords). After all, AI in software testing is all about help. Everyone’s excited about the promise: they can boost our productivity, generate tests, and free us to do the real… Continue Reading…
September 18, 2025
Testing AI applications can feel unpredictable. You might run the same prompt twice and get two different, valid answers, which raises the question: how do you build reliable automated tests for that? AI app testing is a modern challenge. If… Continue Reading…
September 16, 2025
One of the questions I always get asked for recommendations on, is moving from one framework to another—or even one language to another. When you’re already invested in some infrastructure, moving to a new one is, let’s say, risky. Not… Continue Reading…
September 9, 2025
You know what the T in GPT stands for? Transformation. And what is refactoring if not transformation? Refactoring is changing the code, without changing the functionality. And I don’t know if you’ve tried AI code refacotring yet, but it’s a… Continue Reading…
September 2, 2025
The cool thing about testing is the learning. In the world of API test automation, some of us are excited to learn that the system actually does what it needs to. Some of us are excited when we find a… Continue Reading…