Year End Offer - Save up to 50% on tickets - Book Now

Track Talk W5

Unmasking AI: Is Artificial Intelligence Truly Thinking?

Benjamin Johnson Ward

11:30 - 12:15 CEST Wednesday 4th June

As AI, especially modern large language models, integrates into critical aspects of our lives, a pressing question arises: Are these systems truly reasoning, or merely performing pattern matching? This talk examines the nature of AI decision-making and its implications for quality and robustness.

We’ll consider reasoning from a human perspective, contrasting it with current AI models. Tracing AI’s evolution from symbolic logic to today’s LLMs, highlighting the shift from rule-based reasoning to statistical models based on data. Notably, research suggests that while LLMs can emulate reasoning patterns, they may lack genuine understanding—a concept known as “syntactic manipulation without semantic comprehension.”

But does this distinction matter? Is an AI’s lack of true reasoning fundamentally different from human reasoning errors? We’ll explore how these differences impact the quality and fairness of AI systems, examining studies and methods to assess AI reasoning and implications for applications.

One critical factor we will consider, is training data contamination, which can cause AI models to “cheat” by memorizing answers instead of reasoning- misleading us into believing they possess deeper understanding. This directly impacts the quality and trustworthiness of AI outputs.

We’ll delve into practical approaches like adversarial testing, counterfactual evaluations, and detecting training data contamination, to uncover biases, model drift, and weaknesses in robustness.

With AI technology rapidly evolving, we’ll consider how future advancements toward human-like reasoning could impact testing and quality assurance practices.

Attendees will learn the key differences between human reasoning and AI’s pattern recognition, and their effect on application quality and fairness. They will gain practical testing approaches to assess AI quality—bias, drift, robustness—and detect training data contamination, evaluating if the absence of true reasoning affects risks. Moreover, they will learn how to refine testing strategies to remain effective as AI technologies evolve.