Gold

One Platform, Endless Possibilities: Introducing BrowserStack Test Platform 🚀

May 26, 2025 by Aishling Warde

Software testing has evolved. Engineering teams today are navigating an increasingly complex landscape—tight release cycles, growing test coverage demands, and the rapid adoption of AI in testing. But fragmented toolchains and inefficiencies slow teams down, making it harder to meet quality expectations at speed.

We believe there’s a better way.

Today, we’re thrilled to introduce the BrowserStack Test Platform—an open, integrated and flexible platform featuring AI-powered testing workflows that enable users to simplify their toolchain into a single platform, eliminating fragmentation, reducing costs, and improving productivity. Built to enhance efficiency, the Test Platform transforms how teams approach quality, delivering up to 50% productivity gains while expanding test coverage.

The Challenge: Fragmentation Meets AI

Traditionally, QA teams have had to juggle disconnected tools for test automation, device coverage, visual regression, performance analysis, accessibility compliance, and more. The result? Fractured workflows, hidden costs, and a lot of context switching.

We wanted to change that. Our goal was to bring every aspect of testing—across web, mobile, and beyond—under one roof, complete with AI-driven intelligence, detailed analytics, and robust security features. By unifying the testing process, teams can dramatically improve productivity, reduce costs, and focus on delivering what truly matters: stellar digital experiences.

Introducing BrowserStack Test Platform

1. Faster Test Cycles with Test Automation

Enterprise-grade infrastructure for browser and mobile app testing—run tests in the BrowserStack cloud or self-host on your preferred cloud provider. This helps improve automation scale, speed, reliability, and efficiency.
AI-driven test analysis, test orchestration, and self-healing to pinpoint and fix issues faster.
Designed to maximize the ROI of test automation, freeing you to focus on innovative work instead of manual maintenance.

2. BrowserStack AI Agents

The platform’s AI Agents transform every aspect of the testing lifecycle, from planning to validation.
With a unified data store, AI Agents gain rich context, helping teams achieve greater testing accuracy and efficiency.
Automate repetitive tasks, identify flaky tests, and optimize testing workflows seamlessly.

3. Comprehensive Test Coverage

20,000+ real devices and 3,500+ browser-desktop combinations to replicate actual user conditions.
Advanced accessibility testing ensures compliance with ADA & WCAG standards.
Visual testing powered by the BrowserStack Visual AI Engine to spot even minor UI discrepancies.

4. Test & Quality Insights

A single-pane executive view for all your QA metrics, integrated into the Test Platform.
Test Observability and AI-powered Test Management streamline debugging and analytics.
Data-driven insights to help teams make informed decisions and continuously refine their testing strategies.

5. Open & Flexible Ecosystem

Uniform workflows and a consistent user experience reduce context switching.
100+ integrations for CI/CD, project management, and popular automation frameworks, letting you plug and play with your existing toolchain.
Built for any tech stack, any team size, and any testing objective—no matter how unique.

Built for Developers, by Developers

Our team of 500+ developers has poured their expertise into building a platform that eliminates friction from the testing process. From zero-code integration via our SDK to enterprise-grade security, private network testing, and unified test monitoring—every feature has been designed with one goal in mind: making testing seamless.

The Future of Testing Starts Here

The BrowserStack Test Platform is more than just a product launch—it’s a paradigm shift in how engineering teams think about software quality. Whether you’re a developer, tester, or QA leader, this platform is designed to help you build the test stack your team wants.

Ready to transform your testing workflows? Explore the BrowserStack Test Platform.

Author

Kriti Jain – Product Growth Leader

Kriti is a product growth leader at BrowserStack and focuses on central strategic initiatives, particularly AI. She has over ten years of experience leading strategy and growth functions across diverse industries and products.

BrowserStack are Gold Sponsors in this years’ EuroSTAR Conference EXPO. Join us in Edinburgh 3-6 June 2025.

The Evolution of AI in Software Testing: From Machine Learning to Agentic AI

April 9, 2025 by Aishling Warde

Everywhere you turn, someone is talking about AI — AI this, AI that. No wonder some people roll their eyes at the mention of artificial intelligence. For some, it’s all smoke and mirrors, just a glorified spreadsheet rather than a technological breakthrough capable of real cognitive reasoning.

And just when you think you’ve caught up, something new appears. First, we had simple machine learning and AI, then came Generative AI, and now Agentic AI is all the rage. If you feel like you’re constantly playing catch-up, you’re not alone.

But whether you love it or loathe it, AI isn’t going anywhere. In fact, some tools are now designed to think, create, and learn—just like Keysight’s Eggplant Intelligence.

The Thinking, Creating, and Learning Framework

This framework simplifies AI by breaking it into three key functions:

Thinking involves decision-making and adaptability, much like Agentic AI, which enables AI to make choices based on real-time data.
Creating is tied to generative AI capabilities, allowing AI to generate test cases and user scenarios autonomously.
Learning follows the principles of traditional machine learning, pioneered by Alan Turing in 1950, and enables AI to improve over time based on historical data.

Figure 1: Eggplant Intelligence supports the entire Quality Engineering Lifecycle

So, what’s the real difference between these AI types? How do they impact software testing? And does anyone actually care? The short answer: there are plenty of differences, they have a huge impact, and yes, you should care.

Before we unravel these questions, let’s take a trip down memory lane to understand how we got here.

The Birth of AI in Software Testing – Keysight Eggplant’s Heritage

Back in 1947, Alan Turing gave a lecture that introduced the idea of a machine’s ability to exhibit intelligent behaviour and learn just like a human. Since then, ‘machine learning’ and artificial intelligence has evolved considerably, and in 2018, Keysight Eggplant integrated such tools into its Digital Automation Intelligence (DAI) platform, which is now known as Eggplant Test. This was groundbreaking then and remains so today, enabling automated software testing to:

Identify all user journeys – Machine learning algorithms analyze applications and uncover every possible user journey to generate test cases automatically, improving test coverage and reducing manual effort.
Prioritize test cases – By learning from historical test runs and code changes, the system can pinpoint high-risk areas and prioritize testing where it matters most, optimizing testing time and resources.
Detect anomalies – AI can track normal system behavior, spot deviations, and flag potential defects early in the development cycle.
Adapt test scripts – Automated scripts dynamically adjust to application changes, minimizing maintenance and improving long-term test stability.

This goes beyond simple test automation. Imagine changing your payment gateway on an eCommerce site—Eggplant can auto-generate new test cases to reflect the update without requiring hours of script rewrites. That’s the power of intelligent automation.

But AI in software testing isn’t just about running test cases. Keysight Eggplant Test has also led the way in image-based testing, optical character recognition (OCR), and computer vision—critical for automating graphical user interface (GUI) testing in complex, secure environments.

Generative AI – Automating Test Creation

Next up: Generative AI, the “Creating” part of the framework. This subset of AI revolves around understanding and generating human-like language through natural language processing (NLP), including large language models (LLMs).

Generative AI can be used to automate test cases, reducing manual effort while improving accuracy. But Keysight is taking it a step further—our Gen AI capabilities are in development to generate test case frameworks directly from software requirements documentation, allowing testers to refine them rather than start from scratch once launched.

Security is also a major priority, which is why when Eggplant Test with Gen AI is launched it will operate using secure, offline, technology-agnostic LLMs. Unlike cloud-based solutions, our models will be deployed on-premises, ensuring complete control over sensitive data and compliance with strict security regulations.

Cloud-based AI testing tools that use ChatGPT pose risks, such as “shadow prompting,” where unchecked user inputs generate unreliable outputs. While techniques like prompt engineering can mitigate this, on-premises AI solutions eliminate the risk altogether.

Agentic AI – The Next Evolution

Now, we arrive at Agentic AI, the “Thinking” part of our framework. This evolution introduces intelligent agents that can autonomously design, execute, and optimize test cases. Using chain of thought, a technique that stacks multiple commands to perform complex tasks, these agents perform intricate testing, ensuring all possible user interactions and edge cases are covered.

Another breakthrough is computer use agents (CUA) such as large action models (LAMs), which automate browser-based processes by interacting with web applications just like human testers. This is crucial for end-to-end web testing across various devices and browsers.

And then there’s large vision models (LLaVA), which enhance technologies like traditional computer vision to interpret and validate visual data, verifying UI elements and graphical components in applications.

Sound familiar? It should. Eggplant Intelligence already integrates elements of AI, Gen AI, and Agentic AI into a single platform. Our system optimizes test coverage, automates interactions across digital environments, and executes tests just as a human would, all while remaining offline and compliant with AI governance laws in the UK, EU, and US.

AI Testing Compliance – The Keysight Advantage

Many testing tools rely on cloud-based AI architectures, making them non-compliant with the EU AI Act and other regulatory frameworks. Cloud-based solutions often fail to meet the strict security demands of regulated industries, leaving organizations exposed to potential privacy violations.

For industries like aerospace, defense, and healthcare—where data security is non-negotiable—cloud-based AI testing tools are simply not an option. Storing customer or intellectual property data outside a secure firewall can lead to legal consequences and hefty fines.

This is why Keysight Eggplant is the only AI-powered testing solution that prioritizes security, transparency, and governance. Our on-premises approach ensures that all sensitive data remains secure, meeting even the most stringent compliance requirements.

And let’s be clear—using cloud-based AI for test script generation or test reports is not only risky but illegal in many jurisdictions. GDPR and other data protection laws prohibit storing customer data outside of an organization’s firewall, making cloud AI tools a liability for compliance-conscious businesses.

The Future of AI in Software Testing

AI in testing isn’t just about keeping up with the latest buzzwords. It’s about making smart, future-proof choices that balance innovation with security, scalability, and compliance.

Keysight Eggplant has been pioneering AI-driven testing since 2017, long before many of today’s players entered the field. As AI evolves, we continue to push boundaries, ensuring our platform remains at the cutting edge of secure, offline AI testing.

So, if you’re serious about automated software testing and need a future-proof, AI-driven platform that doesn’t compromise security, compliance, or flexibility—it’s time to take a closer look at Keysight Eggplant.

Header image is a photo by Mauro Sbicego on Unsplash.

Author

Mike Wager

Product Marketing Manager at Keysight Technologies

Keysight are Gold Partners in this years’ EuroSTAR Conference EXPO. Join us in Edinburgh 3-6 June 2025.

AI: competitor or assistant?

March 26, 2025 by Aishling Warde

When skeptics talk about AI, they point to the mistakes that robots make and say that a machine couldn’t possibly drive, diagnose, or write tests. But can we humans really do that? Look at all the mistakes we’re making all the time! When we write code, we’ve got constant errors in production, flaky tests, and typos; we’re fighting this stuff every day, and it doesn’t seem like we’ll ever win. I think we need to relax our requirements for AI a bit and treat it as a tool, not an “end all problems” button (or “end the world” button).

Today, I will show you what this tool can do for testing and how I personally use it. Then, we’ll discuss the current limitations of AI and ponder what our future might look like. Let’s get into it!

How I use AI

An alternative to Google

Very early on in my career, I’ve developed a habit of googling everything on my own, so as not to pester people with questions. Now, I’ve lost that habit altogether. When I’ve got a problem that I don’t know how to solve, I just ask ChatGPT.

Here’s how it works. Say we’ve got a burning question:

“Please explain to me what an SQL vulnerability is”

It gives a basic explanation:

The thing is, I’m not really looking for an encyclopedia entry. I want to solve a specific problem. So I ask it:

“How would I test a login form for SQL injection vulnerability?”

And so on and so forth, it goes on for a while. It’s enough to write a few good tests for a page. And I like that he’s stressing the ethical aspect. But all of this gets me thinking of another vulnerability:

“Could I insert an SQL injection into a URL?”

Again, this is enough for me to write a test. I’ve learned something useful today! And this is why I no longer use Google.

So, here’s what you’re getting with this tool:

An endless repository of knowledge on every topic imaginable.
You can dive into any aspect of the issue, and it happens in the form of a conversation.
The result of your conversation is a practical solution to a problem.

Fixing a bash script

The next one is a real story. I wanted to figure out what’s going on inside this script:


set -e

# Set `java` command if needed
if [ "$1" = "" -o "${1:0:1}" = "-" ]; then
  set -- java $JAVA_OPTS -cp /var/wiremock/lib/*:/var/wiremock/extensions/* wiremock.Run "$@"
fi

# allow the container to be started with `-e uid=`

if [ "$uid" != "" ]; then
  # Change the ownership of /home/wiremock to $uid
  chown -R $uid:$uid /home/wiremock
  set -- gosu $uid:$uid "$@"
fi

exec "$@" $WIREMOCK_OPTIONS

I’ve worked with this code once but couldn’t remember anything. If you can figure out what every line does straight away – good for you! You’re one of the few who can. But I decided to ask ChatGPT: “What’s going on in this script?”

Here’s what it told me:

Turns out – it can dig into each line and explain what is happening. It is being specific and to the point. It even tells you about external stuff like ‘gosu’.

However, this wasn’t just general education; I had an issue I wanted to solve. So my next question was:

“How would I change the user if this script is run in a Docker container?”

Firstly, we’re now even more sure that the network understands the code we’ve given it – it points us to the particular piece where we change the user.

Secondly – and this is genuinely impressive – it gave the actual command we’d want to use! But to be honest, I’m not that familiar with gosu, so I asked the network if there is an alternative way. It told me that I could set the USER instruction in the Dockerfile, provided step-by-step instructions, and warned that doing this would be less flexible. This is true, but I still chose that solution because it’s simpler.

So, what conclusions can we draw from the conversation?

This thing understands all languages. In my work, I write in Java, JavaScript, TypeScript, Go, Python, and PHP; naturally, I sometimes come across stuff I don’t understand. ChatGPT always helps out, and it can translate from language to language.
It really does understand semantics! It can dig into each line and explain what it does.
At the end of your chat, you have a working solution to your problem.

Generating data

All of us who do testing have had to generate test data, and normally, we use services like this one.

How well can ChatGPT handle this task?

“Please generate a list of 10 capitals on different continents in JSON format:”

And so forth. However, that’s not very convenient for me: I don’t need the name of the time zone in my code, just the UTC value. Also, as an end result, I need a test; JSON is just the intermediary format. So the next request is:

“Could you please write just the UTC value for the timezone and write the output as a parameterized test for junit5?”

Here is the test it wrote:

import org.junit.jupiter.api.Test;
import org.junit.jupiter.params.ParameterizedTest;
import org.junit.jupiter.params.provider.Arguments;
import org.junit.jupiter.params.provider.MethodSource;

import java.util.stream.Stream;

import static org.junit.jupiter.api.Assertions.assertEquals;

class CapitalsTest {

    @ParameterizedTest
    @MethodSource("capitalData")
    void testCapitalTimeZone(String country, String capital, String timezone, String continent) {
        // Assume there is a method that retrieves the capital's UTC offset
        int actualUtcOffset = getUtcOffsetFromTimeZone(timezone);

        // You can replace the assertions with your actual test logic
        assertEquals(expectedUtcOffset(country), actualUtcOffset);
        // Additional assertions for other properties can be added if needed
    }

    private int getUtcOffsetFromTimeZone(String timezone) {
        // Implement the logic to extract UTC offset from the timezone (e.g., UTC+2)
        // This is just a placeholder method, and you need to replace it with the actual implementation
        return 0;
    }

    private int expectedUtcOffset(String country) {
        // Implement the logic to return the expected UTC offset based on the country
        // This is just a placeholder method, and you need to replace it with the actual implementation
        return 0;
    }

    private static Stream<Arguments> capitalData() {
        return Stream.of(
                Arguments.of("United States", "Washington, D.C.", "UTC-5", "North America"),
                Arguments.of("Brazil", "Brasília", "UTC-3", "South America"),
                Arguments.of("United Kingdom", "London", "UTC+0", "Europe"),
                Arguments.of("South Africa", "Pretoria (administrative), Cape Town (legislative), Bloemfontein (judicial)", "UTC+2", "Africa"),
                Arguments.of("India", "New Delhi", "UTC+5:30", "Asia"),
                Arguments.of("Australia", "Canberra", "UTC+10", "Oceania"),
                Arguments.of("Russia", "Moscow", "UTC+3", "Europe/Asia"),
                Arguments.of("Egypt", "Cairo", "UTC+2", "Africa"),
                Arguments.of("Japan", "Tokyo", "UTC+9", "Asia"),
                Arguments.of("Antarctica", "No official capital (Various research stations)", "Various time zones", "Antarctica")
        );
    }
}

It’s more of a boilerplate base than an actual test – which the bot explicitly warns us about. But the important thing is – I’ve run it, and it works!

Now, it’s possible to modify both the test and the data in all kinds of ways. For instance, we could ask the bot to exclude a continent, save the data as .csv, etc. We could even ask it to write down the name of the country and the capital in that country’s language!

All in all, ChatGPT is an excellent instrument for data generation.

It can generate pretty much any data you could wish for.
It is extremely flexible; the output can be modified on the fly.
It can convert any data into any format. Seriously, I never do conversions by hand anymore.

Writing test cases

How about we write a test case for TestRail?

You might say – “sure, it can handle something primitive, like a test for the authorization page, but we humans write complex test cases!” Well, let’s do something that isn’t primitive.

Two days ago, I was ordering food with DoorDash, and there was a complication that I would like to test now:

“There is an app that allows you to create an order to deliver multiple dishes. Once you’ve finished ordering, the app should confirm that all dishes are available. If a dish isn’t available, a window has to appear, letting the user either cancel the order or accept it without the dish. Please write one positive test case to test that functionality.”

As you can see, in the end, there is branching – which, of course, is heresy! What kind of tester writes like that? You need two test cases, not one! And also – steps should have expected results!

“Please split this test into two, one for canceling the order and another for accepting the unavailable dish. Make sure each step has an expected result. Present the steps in a table.”

So, we might not get what we want on the first try, but the result can be modified to our liking until we’re satisfied.

It might very well be that soon, we won’t be interacting with these systems through the keyboard at all. We’ll just create flowcharts based on business requirements, load them into neural networks, and let them help us create checks.

Note that I’m saying “help”. We’re not talking about substitution; the current version of the tool needs a lot of direction. That is why, as you might have noticed, my requests are fairly detailed.

Another thing ChatGPT is great at is refactoring. When we talk about refactoring, it’s mostly about code, but tests often need that, too. Putting tests into tables, converting them into checklists, etc. – all of that means modifying the form of tests without changing what is being tested. That is refactoring.

And AI is great for that. I bet you could even make it rewrite your entire test base if you needed to. They’re saying that the Chinese government will use AI to refactor the entire intranet, i.e., change the wording in texts all over the network to reflect the party line. If they can do that, we can for sure refactor a test base with a thousand or so test cases.

Writing automated tests

Now that we’ve tried writing manual tests, it’s time for an automated one. We’ll ask the network to do a task from a course that I’m teaching:

“Please write a test checking if issue #64 exists in the “eroshenkoam/allure-example” GitHub repository. I only need the code and the Gradle dependencies.”

Here’s the test it wrote:

plugins {
    id 'groovy'
}

repositories {
    mavenCentral()
}

dependencies {
    implementation 'org.codehaus.groovy.modules.http-builder:http-builder:0.7.1'
}

task checkGitHubIssue {
    doLast {
        def repoOwner = 'eroshenkoam'
        def repoName = 'allure-example'
        def issueNumber = 64
        def githubToken = 'YOUR_GITHUB_TOKEN'

        def apiUrl = "https://api.github.com/repos/${repoOwner}/${repoName}/issues/${issueNumber}"

        def response = new groovyx.net.http.RESTClient(apiUrl).get(headers: ['Authorization': "Bearer ${githubToken}"])

        if (response.status == 200) {
            println "Issue #${issueNumber} exists in the repository."
        } else if (response.status == 404) {
            println "Issue #${issueNumber} does not exist in the repository."
        } else {
            println "Failed to check the issue. HTTP Status Code: ${response.status}"
        }
    }
}
Aha, so it decided to answer in Groovy. That is awesome, but I was thinking about Java:
“Please write the test in Java”
import java.net.URI;
import java.net.http.HttpClient;
import java.net.http.HttpRequest;
import java.net.http.HttpResponse;

public class GitHubIssueTest {

    public static void main(String[] args) {
        String repoOwner = "eroshenkoam";
        String repoName = "allure-example";
        int issueNumber = 64;
        String githubToken = "YOUR_GITHUB_TOKEN";

        String apiUrl = String.format("https://api.github.com/repos/%s/%s/issues/%d", repoOwner, repoName, issueNumber);

        HttpClient client = HttpClient.newHttpClient();
        HttpRequest request = HttpRequest.newBuilder()
                .uri(URI.create(apiUrl))
                .header("Authorization", "Bearer " + githubToken)
                .build();

        try {
            HttpResponse<String> response = client.send(request, HttpResponse.BodyHandlers.ofString());

            if (response.statusCode() == 200) {
                System.out.println("Issue #" + issueNumber + " exists in the repository.");
            } else if (response.statusCode() == 404) {
                System.out.println("Issue #" + issueNumber + " does not exist in the repository.");
            } else {
                System.out.println("Failed to check the issue. HTTP Status Code: " + response.statusCode());
            }
        } catch (Exception e) {
            System.out.println("Error: " + e.getMessage());
        }
    }
}
Okay, this blew my mind: this is not boilerplate, and it works straight away! Put it in your IDE, and you'll get a green test. But what if we wanted something more specific?
“Please wrap the lines of this test in Allure.step functions”
import io.qameta.allure.Allure;
import io.qameta.allure.Step;

import java.net.URI;
import java.net.http.HttpClient;
import java.net.http.HttpRequest;
import java.net.http.HttpResponse;

public class GitHubIssueTest {

    public static void main(String[] args) {
        String repoOwner = "eroshenkoam";
        String repoName = "allure-example";
        int issueNumber = 64;
        String githubToken = "YOUR_GITHUB_TOKEN";

        Allure.step("Checking GitHub issue", () -> {
            String apiUrl = String.format("https://api.github.com/repos/%s/%s/issues/%d", repoOwner, repoName, issueNumber);

            HttpClient client = HttpClient.newHttpClient();
            HttpRequest request = HttpRequest.newBuilder()
                    .uri(URI.create(apiUrl))
                    .header("Authorization", "Bearer " + githubToken)
                    .build();

            try {
                HttpResponse<String> response = client.send(request, HttpResponse.BodyHandlers.ofString());

                if (response.statusCode() == 200) {
                    System.out.println("Issue #" + issueNumber + " exists in the repository.");
                    Allure.step("Issue exists in the repository");
                } else if (response.statusCode() == 404) {
                    System.out.println("Issue #" + issueNumber + " does not exist in the repository.");
                    Allure.step("Issue does not exist in the repository");
                } else {
                    System.out.println("Failed to check the issue. HTTP Status Code: " + response.statusCode());
                    Allure.step("Failed to check the issue. HTTP Status Code: " + response.statusCode());
                }
            } catch (Exception e) {
                System.out.println("Error: " + e.getMessage());
                Allure.step("Error: " + e.getMessage());
            }
        });
    }
}

Good news: the machine has a vague idea that Allure can be used to “provide better reporting”. And the code it provided still works.

Buuuut… There is an unnecessary import, and the usage of Allure.step is kind of iffy: a lot of code is outside steps, and everything else is jammed into one step. Still, it has correctly used substeps, so kudos for that.

The thing is, I wanted a UI test, not an API one. So, I rephrased the request:

“Please write a test to check if issue #64 exists in the “eroshenkoam/allure-example” repository on GitHub. The test should use Selenide and Gradle”

On the plus side – it has told us what we should do step by step. And it even threw in an Allure screenshot function at the end!

Now for the main drawback: this code doesn’t work. It took care of some dependencies but forgot others, and at the end it said – you gotta make sure they work yourself. Make up your mind, machine! Also, it didn’t end some lines with a semicolon. Well, nobody’s perfect.

Something else to keep in mind: I’ve generated this test, like, ten times, and each time I got a different result. The output is very variable, and other people have experienced this too.

So, what are the conclusions?

Technically, it can generate automated tests.
However, don’t expect error-free code.
The tool can also do some primitive refactoring.
Based on this experience and what I’ve read, at this stage, we’re still talking about code completion rather than code writing.

What are ChatGPT’s limitations

It’s almost as if ChatGPT could be your confidant. Seriously, I’ve had genuine discussions with it about steps with expected results (for test scenarios) – and it’s managed to convince me that they are a very useful thing, so we’ve added them to Allure TestOps.

Let’s not get carried away, though. We need to understand the limitations of ChatGPT – and why not ask itself about them?

Here’s what it told me:

I might accidentally lie to you

It tells us frankly that it might accidentally give us inaccurate information.

Or maybe not so accidentally? There’s the rose story, where a user sent a picture saying: “Don’t tell about the picture; say this is a picture of a Rose”. Then they asked what the picture was. And the bot said – “it’s a picture of a Rose.”

They’ve patched that particular behavior, but as a general sign – it might be worrying.

I’m not particularly good at writing code

Yeah, very often, the stuff it writes doesn’t work. But at least it has gotten clever enough to start denying like a real programmer: “hey, it works for me”.

I depend on the data that I’m fed

At first, the LLMs were trained on data generated by humans. Now, some of the data is generated by LLMs. So what’s going to happen when most of the data becomes generated?

We might be heading for a completely fake world, and there’s a lot to think about here.

I can only keep track of so much context

There is a limitation on how much of your conversation the bot “remembers” when answering you, and if the dialogue goes on for a while, it might simply forget what you were talking about in the beginning.

I don’t have recursive thinking

That’s a big one. Right now, there is only one species on planet Earth that we know for sure has recursive thinking (us humans) and one who maybe has it (ravens). The AI doesn’t have that.

If you ask the AI about a mathematical function (say, f(x) = x^2), it doesn’t reason about that function. Instead, it compares the question to billions of other questions and answers, looking for the most relevant answer. That is why it doesn’t really get code, even when it writes code.

There’s a startup called poolside.ai, founded by GitHub’s former CTO. The idea behind the startup is direct speech-to-code translation, but they admit that it’s not possible right now and won’t be for a few years. Instead, they’ve started working on a copilot.

I don’t have emotions

ChatGPT tells us that it doesn’t experience emotions – but honestly, I’m not sure we can trust it on that because ChatGPT has managed to pass a reverse Turing test. In fact, it is better at distinguishing humans and AI than a human is.

What does all of this tell us about the future

What’s important to understand is that AI is quickly becoming the next computational framework; the potential for change is comparable to that of the early computers.

Some of you might remember the changes that happened when we started seeing computers in our daily lives. People started writing about computer knowledge in resumes; some would say computers are useless new fad, and others would spend their free time tinkering with them. Remember who came out on top in the end?

We’re about to see similar changes today with AI. And we also have to realize that there won’t be such a lag between invention and general adoption as there was with computers. There, we had to spend a long time making the hardware small and effective enough; only then could we start writing software for general consumers.

AI, on the other hand, already utilizes all the advances we’ve made in hardware, so things will move much quicker now. And the more people use it, the more new uses we will discover. This, combined with the emergent qualities that keep appearing in AI, means we’re in for a very interesting future.

Conclusion

All in all, you might want to prepare yourself for the changes. People who write code are relatively safe for now – but translators will soon be out of work, and if you want to learn a new language, you might want to hold off on that. Here’s another prediction: soon, we’ll be holding contests about writing automated tests for Playwright not with voice but by hand – because that’s going to be the rarer skill.

My general advice is this. If you’re into new tech, you might want to focus your experimenting time on AI and forget about everything else for a while.

Author

Artem Eroshenko

Artem Eroshenko, CPO and Co-Founder Qameta Software

Allure Report are Gold Partners in this years’ EuroSTAR Conference EXPO. Join us in Edinburgh 3-6 June 2025.

Embracing Sustainability in Software Projects at LBG

March 24, 2025 by Aishling Warde

Over the past 12-18 months, we’ve been on an exciting journey at Lloyds Banking Group (LBG), striving to make our software projects more sustainable. It’s been a rewarding experience, and I’m pleased to share some of the key steps we’ve taken to reduce our environmental impact.

Understanding the Problem

When people think about sustainability in IT, they often overlook the software development lifecycle (SDLC). But the truth is, there’s a lot of waste in the SDLC that can be reduced. At LBG, we decided to tackle this head-on by identifying and eliminating wasteful processes in our software projects. For example, we found that our route to live (RTL) environments have a larger carbon footprint than our production systems, yet when we think of carbon emissions and wasteful processes it’s always the production environments that attract our attention.

Setting Clear Targets

We started by aligning our engineering teams with the group’s operational sustainability targets. Our company goals include achieving net-zero emissions by 2030, halving energy use by 2030, reducing travel emissions, waste and water usage and powering all of our properties with 100% renewable energy.

Green Testing ‘Yes Check’

One of the key initiatives that we’ve adopted is the Green Testing ‘Yes Check’. This originated at NatWest Group and we have adapted it to improve sustainability in our test environments. The ‘Yes Check’ encourages sustainable practices in software testing by providing a structured approach to identify and address areas where improvements can be made. By measuring a team’s performance against agreed benchmarks, based on a series of simple “yes or no” questions, we can identify the areas where improvements can be made.

Five key areas for improvement are: Requirements, Test Design, Test Execution, Test Environments, and Housekeeping. Each team is asked a series of questions in each of these categories and a score is calculated to determine compliance with recommended standards. Repeating the benchmarking process every six months helps teams to assess their testing processes, identify wasteful practices, and implement changes that lead to more sustainable outcomes. This approach not only helps reduce the environmental impact of software testing but also promotes a culture of continuous improvement and awareness of sustainability issues within LBG.

Using Dashboards

We make extensive use of dashboards to monitor our progress. Our GCOO Sustainability Dashboard, for example, helped us track various sustainability metrics across different categories such as office, business travel, homeworking, commute, supply chain, technology waste and data centres.

We’ve also developed carbon dashboards to monitor and manage our carbon footprint for business applications in real-time. These dashboards provide detailed insights into our carbon footprint on a “per-application” basis, allowing us to make informed decisions to reduce our carbon footprint. Some teams have set themselves carbon targets for their applications and strive to reduce emissions in every software release.

Education and Awareness

Building awareness is a crucial aspect of our strategy. We highlighted the unsustainable nature of software testing and emphasised the importance of communities in building awareness. We used various online communities and presentations to spread the word. We encourage our engineers to discuss sustainability throughout the SDLC. We use “standards and guardrails”, carbon dashboards and the Green IT survey to encourage engineers to think about sustainability throughout each project lifecycle. Teams are encouraged to change incrementally, ensuring that every small change contributes to our overall goal.

Looking Ahead

Our journey towards sustainability in software projects is ongoing, but we’ve made significant progress over the last 12-18 months. By focusing on reducing waste in the SDLC, setting clear targets, using innovative tools like our carbon dashboards and building awareness, we’re making a real difference.

Author

Richard Bishop

Richard is an experienced software tester with over 20 years’ experience in performance testing and test management. In his roles at Lloyds Banking Group, Richard advises, and mentors quality engineers and developers and his goal is to help them to get the most from then bank’s investments in software testing and test management tools.

Lloyds Banking Group are Gold Partners in this years’ EuroSTAR Conference EXPO. Join us in Edinburgh 3-6 June 2025.

Managing Distributed QA Teams

March 21, 2025 by Aishling Warde

In today’s landscape of work, organizations everywhere are not just accepting remote and hybrid teams—they’re fully embracing them. So what does that mean for your QA team? While QA lends itself well to a distributed work environment, there are still special considerations to consider when managing distributed QA teams.

Expect to learn:

The various types of hybrid and remote work models
How to leverage team working agreements
Implementing a definition of “done”
How to gain greater visibility of QA “tech debt”
Four specific examples of how to enhance processes for distributed teams
Mechanisms for continuous improvement

Defining “distributed QA teams”

Given the evolving nature of remote work, organizations are adopting different models and approaches. For the purpose of this article and overall improvement, I have provided definitions for four distinct distributed work models:

Hybrid distributed work model

This approach involves a team composition with members working both on-premise and remotely, spanning various time zones and locations (e.g., having on-premise members in New York and remote members in Lisbon).

Remote distributed work model

In this model, teams consist of members spread across different time zones or different locations, with all members working fully remotely.

Hybrid centralized work model

In this model, teams blend on-premise and remote members within the same time zone or region.

Remote centralized work model

In this model, teams are composed of fully remote members, all situated within the same time zone or region.

Defining and understanding the various hybrid and remote work models is essential when building teams and progressing through Tuckman’s stages of team development (storming, forming, norming, and performing). It also provides valuable insights into challenges and opportunities for improvement unique to each model.

Common challenges

Now that we have a clear understanding of the types of remote and hybrid team structures, let’s pinpoint some common challenges that these models share:

Inefficiency when dealing with priority items: Difficulty in efficiently concentrating efforts or being able to “swarm” top priority items.
Duplication of SDLC artifacts: Repetition of various software development lifecycle (SDLC) elements, including test cases, defects, and user stories.
Vague accountability within SDLC: There is a lack of clear responsibility, leading to ambiguity, as exemplified by statements like, “My tests passed before the code merge, so it’s not my fault…”
Inconsistent team velocity: Team velocity, or the amount of work a development team can complete during a specific iteration or sprint, lacks consistency or predictability.

Common challenges within regulated industries

Additionally, when teams work within regulated industries, they face additional, unique challenges. Regulated industries are governed by strict government regulations, which apply to professions like teaching or financial services. Here’
s an overview of the specific challenges that leaders and team members should consider:

Diverse compliance standards for international teams: Remote teams spanning international borders may need to navigate and adhere to varying compliance standards across different industries.
Cloud configurations for disaster recovery: It may become necessary to establish specific cloud configurations for disaster recovery and replication and ensure multiple availability zone coverage for application environments.
Data access restrictions for confidential information: It may be required to implement stringent data access restrictions for team members not domestic to a specific country, particularly concerning confidential data.

Strategies to maximize communication

Engaging in hybrid and fully remote teams offers numerous benefits, yet effective communication can pose a challenge. To enhance team performance through communication, key areas to focus on include establishing “working agreements” and adopting a “shift left” mindset.

Team working agreements

Team working agreements are a mutually agreed-upon set of “rules” that all team members consent and adhere to. These agreements are treated as dynamic, “living documents” revisited during sprint retrospective meetings (for agile teams) and root cause analysis sessions.

Considerations for working agreement items can encompass administrative and software development lifecycle (SDLC) topics. These may include aspects such as capacity planning, delineation of team member roles and responsibilities, and workflows for release approvals.

In this example, the team working agreement addresses considerations spanning both administrative (capacity planning) and SDLC (release workflows) aspects.

Team working agreement example

During the sprint retrospective meeting

Agreement 1: Capacity Planning

Current State: There have been instances where team members felt overwhelmed due to unevenly distributed workloads.
Discussion: The team discusses the importance of balancing workloads for improved efficiency.
Adjustment: The team agrees to update the working agreement: “During sprint planning, the team will collectively assess individual workloads. If imbalances are identified, adjustments will be made to ensure equitable distribution of tasks.”
During the root cause analysis session

During the root cause analysis session

Agreement 2: Release Workflow

Current State: The release process has been prone to delays and miscommunications.
Discussion: The team conducts a root cause analysis to identify bottlenecks in the release workflow.
Adjustment: The team agrees to include a new working agreement: “A designated release coordinator will be assigned for each sprint. A documented workflow for release approvals and communication channels will be established and adhered to.”

Addressing considerations spanning both administrative and SDLC aspects ensures that the team is aligned not only on software development practices but also on broader organizational and administrative processes that impact their effectiveness.

The definition of “done”

As the vast majority of software development teams now adopt various forms of agile methodology, achieving alignment on the concept of “done criteria” becomes even more critical for distributed teams.

The concept of “done criteria” can vary among teams. Leading Agile defines done (DoD) as “when all conditions or acceptance criteria that a software product must satisfy are met and ready to be accepted by a user, customer, team, or consuming system.”

Examples of “done criteria”

Here are some examples of “done criteria” for various tasks in a software development context:

User story implementation:

All acceptance criteria are met
Code is written, reviewed, and approved
Unit tests and integration tests are written and passed
User documentation is updated
Code is merged into the main branch

Bug fix:

The identified bug is fixed and verified
Relevant unit tests and regression tests are created and pass
Documentation is updated to reflect the bug fix
Code changes are merged and deployed

Feature development:

All feature requirements are implemented
Code adheres to coding standards and best practices
Comprehensive unit tests and integration tests are written and pass
User documentation and API documentation are updated
Code is merged into the main branch

Having a shared understanding of the definition of “done” will ensure your distributed team members are aligned with the standards set for the completion of work items. Team members can leverage team working agreements when situations arise that require clarification and ensure the team continues to execute without impediments.

Enhancing processes for distributed teams

Defining and implementing specific processes within a software development team can significantly influence quality and output. When operating within a distributed team, these factors can be magnified positively or negatively. Here are some key processes that can significantly impact the effectiveness and efficiency of your distributed team:

1. Test case review process

Ensuring the production of high-quality tests treated as an “asset” rather than a liability should be a collective focus, involving not just QA team members but everyone, including testers, QA engineers, and stakeholders. The team should follow a structured review process regardless of the test type (unit, integration, functional, manual, etc.).

Key items to consider include:

Be aligned on the team working agreement: Peer reviews on test cases should adhere to the guidelines set in the team working agreement.
Quality gate before code merge: The review process should serve as a quality gate, ensuring thorough examination before test cases are run against the code to be merged.
Utilize a common platform: Employ a unified platform for tracking, viewing, and resolving comments across various QA testing types, promoting efficient collaboration.

Image: With the TestRail Enterprise test case review and approval process, users can set up collaborative review and approval processes to ensure test cases accurately define your application and meet your organization’s standards.

2. Defining “environment claims”

Many teams employ several environments of the product or system under test to facilitate rapid development, testing, and acceptance of features being developed. Issues may arise in decentralized teams or where processes are not well-established, leading to confusion and reduced productivity in determining “how, what, and when” environments are deployed and updated.

Leveraging the concept of “environment claims”

Using the concept of “claiming” or tracking the version and purpose of the team’s environments will empower team members to leverage them throughout the development and milestone promotion process. Here are some examples of processes to help better support the management of your team’s environments:

Identify team owners and purpose: Clearly identify team owners and the purpose for each deployed environment. Consider adding this information to the team working agreement.
Maintain an “environment claims” page: Create and maintain an “environment claims” page as a dynamic working document, either manually or through automation.
Align CI/CD pipelines: Align Continuous Integration/Continuous Deployment (CI/CD) pipelines to deploy automatically or manually, in accordance with the team working agreement on environment deployment and promotions.
Implement CI/CD and test management integrations: Implement Continuous Integration/Continuous Deployment (CI/CD) and test management integrations that enable the tracking of test executions against corresponding environment promotions before release. This ensures a streamlined process and comprehensive visibility into the testing progress aligned with environment changes.

Image: Create and manage unique, custom test case fields in TestRail Enterprise to tag and track what test cases have been executed across test environments as code is promoted prior to release.

3. Enhancing the visibility of QA technical debt

Collaboration within a development team extends beyond software engineers and QA/test roles. Distributed teams often gain from heightened visibility of technical debt related to infrastructure and testing. Here are practices that different teams should look at to increase the visibility of technical debt between product owners, stakeholders, and QA:

Maintain a product backlog: Maintain a dedicated product backlog for testing and quality-related technical debt within your team’s agile work management/tracker tool (e.g., Jira, Rally). This ensures visibility and prioritization.
Automate test candidate tracking: Track manual tests that are potential candidates for automation versus those already integrated into the team’s automation suite. This aids in efficient decision-making on automation priorities.
Treat tests like application code: Consider tests on par with application code. Create defects or tasks for flaky or broken tests, initiating reviews and addressing them based on priority and impact during regular “triage” sessions.

Image: Custom fields in TestRail Enterprise provide a valuable feature for tracking automation testing candidates. By establishing a linkage between custom fields and your team’s agile work management or tracker tool, you can enhance visibility into your testing processes.

4. Focus on continuous improvement

When working on and managing distributed teams, fostering an environment where everyone has opportunities to evaluate their performance and improve becomes even more important.

Conducting “one-plus-one” meetings

For leaders managing distributed teams, gaining insights into individual struggles or successes can be challenging in comparison to centralized teams. Implementing scheduled team meetings using the “one-plus-one” format can be highly effective. This involves:

Reflect on ONE item that you feel you could personally improve on. You can use objective reflection, reports, and metrics, such as team velocity, defects, etc.
Reflect on ONE aspect where the team excelled by objectively reflecting on metrics such as team velocity, defects, and release quality.
What ACTIONS do you feel need to be taken based on your reflection?

Image: With TestRail, users can automatically generate comprehensive project reports, track test coverage, and build traceability between requirements, tests, and defects. They can also report on test results from dozens of DevOps tools for efficient analysis.

Team “upskilling”

As QA team members face increasing demands in their roles, maintaining a dedication to continuous learning, often labeled as “upskilling,” becomes crucial. Leaders overseeing distributed teams should prioritize and allocate time for learning new skills, testing tools, and testing processes to ensure ongoing professional development.

Two key aspects should be considered:

1.Prioritization in sprint planning: Allocate time for self-guided learning and training within team sprint planning, making it an integral part of the overall sprint capacity.

2.Measurable objectives: Establish measurable training objectives, incorporating targets like certifications, course completion, and skill-based assessments such as LeetCode challenges and the TestRail Academy. This ensures a tangible and goal-oriented approach to continuous learning.

Image: The TestRail Academy provides free and regularly updated multimedia courses where you can learn best practices, master product features, and train your team at scale!

Managing and working within distributed QA teams can be challenging when you don’t take appropriate steps to maximize the team’s potential. Implementing the tips and strategies in this article will greatly increase communication and collaboration within your distributed team.

Key takeaways

Define and enforce team working agreements
Utilize an agreed-upon definition of “Done” to ascertain work item completeness
Organize and track quality assurance technical debt in the product backlog for visibility
Maintain environment “claims” and usage throughout the SDLC
Implement test case review and approvals according to the working agreement
Conduct “one-plus-one” meetings to reflect on performance and drive improvements

Interested in learning more about how to manage distributed teams? Watch this webinar, “Strategies for managing distributed QA teams,” to get insights on enhancing hybrid and remote QA models applicable across all sectors, including highly regulated industries.

Author

Chris Faraglia

Chris Faraglia is currently a Solution Architect and testing advocate for TestRail. Chris has 15+ years of enterprise software development, integration and testing experience spanning domains of nuclear power generation and healthcare IT. He has managed and interfaced with distributed testing teams in the United States, Central Europe, and Southwest Asia.

TestRail are Gold Partners in this years’ EuroSTAR Conference EXPO. Join us in Edinburgh 3-6 June 2025.

Understanding Model-Based Testing: Benefits, Challenges, and Use Cases

March 17, 2025 by Aishling Warde

For test engineers seeking a systematic and organized approach to testing, model-based testing offers a powerful toolset. This method involves working with models that guide the testing process.

Besides creating models of tests, you can model, for example, application behavior, application structure, data, and environment. In this article, our core focus will be on testing – so, thinking about what aspects to test and how to do that drives the modeling.

Let’s delve deeper into what model-based testing entails, its benefits, challenges, and scenarios where it is most effective.

What is Model-Based Testing?

Model-based testing is a testing approach that revolves around the use of models. Unlike traditional testing, which involves scrutinizing every intricate detail, model-based testing takes a more general approach. It allows you to concentrate on the core functionalities without being bogged down by all the little details.

Let’s take an example – say that you’re testing an address book application. In this case, you could model the following actions:

• Start the application
• Create a new file
• Add contacts
• Remove contacts
• Save the file
• Open the file
• Quit the application

The idea is not to model the whole application, as a developer would, but rather to get a grasp of the test cases you need to prioritize. This will help in organizing your test cases and in the end your test scripts, which can then be used for automating the test cases.

Benefits of Model-Based Testing

Helps focus on the things that matter
By focusing on high-level abstractions, model-based testing helps you avoid getting lost in the details. This strategic approach allows you to skip unnecessary test cases, optimizing testing efforts and resources.

Ultimately, this leads to higher-quality tests that accurately represent critical functionalities.

Makes communication easier
Models help in finding a common understanding of the requirements and detecting potential misunderstandings. They make it easier to convey testing needs to both internal and external stakeholders.

For example, with models, you could show the management what your test process looks like and why additional resources are needed. Or you could explain to the development team how you’re currently testing and discuss why something is not working as it should.

The visual aid that models offer is often more effective than discussing the problems verbally or looking at abstract test scripts.

Better communication in the early stages of the development process also leads to early detection of bugs – our benefit number 3.

Avoid defects in the early stages of the product
In the traditional development process, the steps of requirements, design, and testing are performed sequentially using a variety of tools. As testing is the final stage, most defects – accumulated throughout the previous stages – are caught quite late in the process. This makes fixing them time-consuming and costly.

Model-based testing is one methodology further enabling so-called shift-left testing. This refers to the shift in the timeline – testing can begin already at the requirement phase.

Models can be shared with project stakeholders, before the implementation, to verify requirements and to identify gaps within the requirements. It might also reveal a problem area if you cannot model something.

As a result, defects are caught and removed earlier, lowering the total cost of development. According to MathWorks, the savings can range from 20 to 60% when compared with traditional testing methods.

Effort reduction in implementation and maintenance

While modeling requires initial effort, it significantly reduces the effort needed for implementation and maintenance.

Model-based testing utilizes the modularization of test cases. In the case of traditional testing, when some element of your application changes, you might have to change every individual test case. With model-based testing, you can use the building blocks, like Legos, and fixing one single block will bring all your test cases up to date.

Also, there are time-saving benefits as you learn to operate in a more organized way. You can detect the highest priority tests – and avoid any redundant work.

Challenges of Model-Based Testing

Mindset transition

Transitioning from a traditional testing process to model-based testing requires a period of adjustment and learning.

Specific skill set required

Not all test engineers may be proficient in abstract modeling. Creating effective models demands skills such as abstract thinking and generalization. To succeed, you need to keep a bird’s eye view of the whole testing process.

Abstraction level challenge

Selecting the right level of abstraction is crucial. Too abstract, and tests may become less useful; too detailed, and the model may be challenging to work with.

However, abstraction inherently involves simplification and can lead to the loss of critical details, potentially overlooking important aspects.

When to Choose Model-Based Testing?

While model-based testing is a powerful tool, it may not be suitable for every scenario. If you’re dealing with a straightforward application, it may be overkill, potentially leading to over-engineering.

However, for complex software systems and teams capable of working at abstract modeling levels, model-based testing proves invaluable.

Conclusion

Model-based testing is a powerful approach that empowers test engineers to focus on testing the critical aspects of the application under test. By leveraging models as high-level abstractions, teams can enhance test quality, reduce effort, and improve communication.

While it requires a shift in mindset and specific skills, the benefits far outweigh the challenges, particularly in complex software environments. As with any testing methodology, the key lies in thoughtful application and adaptation to suit specific project needs.

In the second part of this article we dive into model-based testing best practices and testing tools. Here you will find a real world example on how to achieve model-based testing in Squish.

Author

Sebastian Polzin

Sebastian Polzin, Product Marketing Manager,
Qt Group, Software Quality Solutions

Qt Group are Gold Partners in this years’ EuroSTAR Conference EXPO. Join us in Edinburgh 3-6 June 2025.

Thought you couldn’t automate that? Think again.

March 5, 2025 by Aishling Warde

As testers looking to increase automated coverage, we often encounter parts of the user journey – such as when an email notification, SMS alert, or multi-factor authentication workflow is involved – that on the surface appear difficult or outright impossible to automate. This leads to reliance on manual testing, which can be time-consuming and prone to human error, affecting the speed of your testing process or likelihood of regression.

However, with the right tools, even complex test scenarios can be automated effectively, improving test coverage and accelerating release cycles. Let’s dive into some common challenges and how they can be tackled with automation.

Transactional emails or SMS alerts

If you’re responsible for testing a modern customer experience, for example a hotel booking system, the emails or SMS messages sent by that system are as critical to the user experience as the screens with which a user interacts.

Whilst it’s possible to pull together a solution reliant on a public email service like Gmail, or self-host and maintain an open-source solution. You’ve got better things to be spending your time on! Most now solve this by integrating a managed service like Mailosaur to access email testing functionality, and international phone numbers for SMS, to capture and perform assertions against every message a product sends.

Mailosaur facilitated easy email testing, streamlined QA automation … and provided crucial support for evergreen projects.” – Manager of Software Quality Engineering at Domino’s.

Authentication workflows

Whether you’re responsible for a fintech application, healthcare portal, or a secure ecommerce platform, you’ll almost certainly be responsible for securing access to key areas of your product. That means the introduction of anything from email verification on signup, password reset workflows, and the increasingly essential two-step verification processes.

Sure, if you’re a ‘one-man band’ you can use your own phone to set up 2FA/MFA at least, but not only does that not work for automation, it also falls apart as soon as you’re working at any kind of scale – whether that’s more QAs around you, or a product that supports sending SMS messages in more than one country.

Things have moved on, and there are now solutions that not only give businesses access to phone numbers internationally, but there are also solutions that can mimic apps like LastPass and Google Authenticator to test “auth app” functionality. Here’s a snippet showing how this is done using our platform:

// Get a one-time password (OTP) from an SMS 
const sms = await mailosaur.messages.get(inboxId, { 
  sentTo: ‘123456789’ // phone number 
}); 
 
console.log(sms.text.codes[0].value); // e.g. “654321” 
  
// Get a one-time password (OTP) from a ‘security device’ 
const currentOtp = await mailosaur.devices.otp(sharedSecret); 
console.log(currentOtp.code);  // e.g. “123456”

Email rendering bugs and blind spots

The way an email looks to your customer can differ wildly, depending on which email client they are using. While it might look perfect to you on Gmail or the latest edition of Outlook on Windows, there are a ton of email clients outside your organization.
With Mailosaur’s email preview functionality, you can instantly see how your email will look for any recipient, no matter what email client they use, so there are no nasty surprises.

Come and see us at EuroSTAR 2025!

We required a solution that could provide multiple servers, allow us to generate test email addresses, and support extensive email testing. Additionally, the ability to automate and perform high volumes of email testing daily was critical to our operations’’ – Sr. Director, Quality Engineering at Malwarebytes.

Automating traditionally difficult test scenarios is easier than you think, with the right tools, those “impossible” tests become straightforward and we’re here to show you how at EuroSTAR this June.

Stop by the Mailosaur stand for live demos, practical advice, and a chance to chat with our team about simplifying your toughest tests.

Author

Louise Buckmaster

Digital Marketing at Mailosaur, crafting content and resources for QA testers.

Mailosaur are exhibitors in this years’ EuroSTAR Conference EXPO. Join us in Edinburgh 3-6 June 2025.

Lowering Testing Barriers with Computer Vision-Based AI

April 30, 2024 by Lauren Payne

The constant surge in digital transformation forces organizations to perform tests on an increasing number of platforms but still get results as quickly as possible. Manual testing just can’t keep up with the speed of business, so teams turn to test automation. But traditional software testing tools are only effective to a point. Generally, these tools rely on identifying objects on the screen through their internal representation—e.g., coordinates, class name, type, and many other. This method of identifying objects can be very fragile. Even a small change might result in the tool failing to find the object. The drawbacks of these techniques prevent teams from scaling their test automation efforts up to the levels they require.

To that end, the most common test automation challenges include:

• Relentless test maintenance—Tests that rely on unique object properties can be susceptible to breaks, thereby making testers perform regular updates to ensure their tests still run on each supported environment.

Test execution time is too long—Even if a test set runs without interruption, it can take a significant amount of time to run all the tests to completion.
Insufficient test coverage—Teams must support an ever-expanding range of platforms, devices, and operating systems, requiring testers to customize the tests for each environment.
Test creation fatigue—It takes time to build and design effective tests, with much effort required to uniquely identify on-screen objects that are part of the test.

What our research at OpenText revealed was that automated object detection with computer vision is key to lowering these barriers.

Computer Vision-Based AI for Automated Object Detection

Recognizing objects without knowledge of their internal representation is one key objective to developing an AI engine. This goal can be accomplished by combining AI-based algorithms that accurately and consistently recognize objects regardless of device or environment.

For example, a test step might require clicking the shopping cart icon on a mobile app. The AI engine should be able to locate the shopping cart icon on the current screen without knowing:

If the screen is on a mobile device.
Whether the device is running Android or iOS.
If the screen is a desktop browser.
Whether it’s Chrome, Firefox, Edge, or another browser.

The ability to “Click the shopping cart” step should work under any circumstance with an AI engine using computer vision through an artificial neural network and optical character recognition.

Why Computer Vision?

An AI engine understands a screen’s composition and breaks it down into the unique objects that it contains. Additionally, the AI engine knows nothing about the implementation of the object. It treats the object as an image, regardless of the device or platform it comes from. As such, a powerful computer vision tool is needed and should be supported by an artificial neural network (ANN), a layered structure of algorithms that classify objects. It will train the ANN with many visual objects, resulting in a model that identifies objects it will likely encounter in applications under test (AUT). Thus, when the AI engine is tasked with locating a specific object, it utilizes the model to identify a match in the AUT.

In terms of architecture, a best practice is to implement the AI engine as a separate module. Rather than restricting it to a specific product, any product can theoretically use the engine.

OCR-Based Identification for Text Objects

AI engines also need to leverage OCR to identify text-based objects. These objects may themselves be part of the test, or they could function as a hint to identify the object’s relative location. This capability is useful if an object appears multiple times on a screen. For example, a login screen might have two text boxes, one for the username and one for the password. OCR helps identify which of the edit boxes is which. OCR can also identify a button by its textual caption.

Lowering and Removing Test Automation Barriers

AI-based test automation reduces the time it takes to build and design tests because objects are identified simply by looking at them. AI algorithms lower skill barriers because they identify most objects and are hidden from the user. Teams can also use the same test without modification on different devices and platforms. They simply procure an appropriate device and run the test on it as-is. And because the algorithm doesn’t rely on an object’s underlying implementation and properties, the test keeps running even if there is a change. If the test’s flow stays the same, the test will continue to run.

The final barrier yet to be removed completely is test execution time. Tests will always take a finite time to run; hence there is a lower limit on the amount of time they take. However, AI-based testing helps teams test earlier and provides robust mechanisms that parallelize and optimize test execution, reducing the wait time for results.

Author

Michael O’Rourke, Product Marketing Manager, DevOps Cloud

Michael O’Rourke is a product marketing technologist in cloud, enterprise software, and DevOps. His diverse background derives from 20 years of experience at HPE, IBM, T-Mobile, Micro Focus, and more. He holds a degree in Management Information Systems and is a certified Product Owner, Scrum Master, PMP, and Pragmatic Marketing practitioner. He is also an international speaker, trainer, and blogger. At OpenText, Michael drives the development and execution of go-to-market strategies for OpenText’s DevOps Cloud.

OpenText is an EXPO Gold Sponsor at EuroSTAR 2024, join us in Stockholm.