Articles

May 25, 2023

2 MIN READ

Prior To Chatbot Deployment, It Is Essential that Intents are Ground-Truthed To Ensure Accuracy

May 25, 2023

2 MIN READ

Latest content

Customer Stories

4min read

Lightspeed Uses HumanFirst for In-House AI Enablement

Meet Caroline, an analyst-turned-AI-expert who replaced manual QA, saved countless managerial hours, and built new solutions for customer support.

December 10, 2024

Customer Stories

4 min read

How Infobip Generated 220+ Knowledge Articles with Gen AI For Smarter Self-Service and Better NPS

Partnering with HumanFirst, Infobip generated over 220 knowledge articles, unlocked 30% of their agents' time, and improved containment by a projected 15%.

September 16, 2024

Articles

6 min read

AI for CIOs: From One-Off Use to Company-Wide Value

A maturity model for three stages of AI adoption, including strategies for company leaders to progress to the next stage.

September 12, 2024

Articles

7 min read

Non-Technical AI Adoption: The Value of & Path Towards Workforce-Wide AI

Reviewing the state of employee experimentation and organizational adoption, and exploring the shifts in thinking, tooling, and training required for workforce-wide AI.

September 12, 2024

Tutorials

4 min read

Building Prompts for Generators in Dialogflow CX

How to get started with generative features.

August 15, 2024

Announcements

3 min read

HumanFirst and Infobip Announce a Partnership to Equip Enterprise Teams with Data + Generative AI

With a one-click integration to Conversations, Infobip’s contact center solution, HumanFirst helps enterprise teams leverage LLMs to analyze 100% of their customer data.

August 8, 2024

Tutorials

4 min read

Two Field-Tested Prompts for CX Teams

Get deeper insights from unstructured customer data with generative AI.

August 7, 2024

Customer Stories

5 min read

HomeServe Uses HumanFirst to Empower Non-Technical Teams with Conversation Data

July 29, 2024

Tutorials

5 min read

Optimizing RAG with Knowledge Base Maintenance

How to find gaps between knowledge base content and real user questions.

April 23, 2024

Customer Stories

4min read

Lightspeed Uses HumanFirst for In-House AI Enablement

Meet Caroline, an analyst-turned-AI-expert who replaced manual QA, saved countless managerial hours, and built new solutions for customer support.

December 10, 2024

Customer Stories

4 min read

How Infobip Generated 220+ Knowledge Articles with Gen AI For Smarter Self-Service and Better NPS

Partnering with HumanFirst, Infobip generated over 220 knowledge articles, unlocked 30% of their agents' time, and improved containment by a projected 15%.

September 16, 2024

Articles

6 min read

AI for CIOs: From One-Off Use to Company-Wide Value

A maturity model for three stages of AI adoption, including strategies for company leaders to progress to the next stage.

September 12, 2024

Better context, better results

Join the Waitlist Book a Demo

Prior To Chatbot Deployment, It Is Essential that Intents are Ground-Truthed To Ensure Accuracy

COBUS GREYLING

May 25, 2023

2 MIN READ

Reaching targeted levels of intent recognition can be quickly achieved by verifying intents before deployment, rather than adopting a corrective strategy as a response.

Intro

Each organisation has a benchmark or percentage of successful intent recognition for their chatbot or voicebot. The percentage of successful intent recognition, or the percentage of none-intents are often part of the main dashboard and tracked metrics.

Firstly, it needs to be noted that there will always be a certain percentage of none-intents.

Something to keep in mind, digital assistants are linked to an organisation and addresses a finite number of products and services. Hence out-of-domain queries will occur and might register as none-intents.

For instance, at a large mobile operator, our aim was to limit the percentage of none intents to < 10%.

The Challenge

Classification of text and creating labels are very much a standard procedure in the AI world. The challenge though when it comes to digital assistants, is that the classification of the user utterances cannot be an asynchronous process, but needs to be synchronous.

The live conversations need to be classified (assigned to intents) in real-time as the conversation unfolds.

Hence the chatbot needs to have the classifications/intents preloaded, having a good sense of what the ambit of user conversations might entail.

Ground-Truthed Intents

Intent classification is best performed by using a corpus of text data. The text data should ideally be customer conversations, or utterances. And the text data can also be transcribed audio.

This data is then grouped in semantically similar clusters, each of these clusters constitute an intent and can be assigned a label (also referred to as an intent name).

These labeled intents can be considered as ground truth in terms of intents when it comes to coverage. This is also an effective way for solving for the long tail of intent distribution.

Subsequently a machine-learning process can be used for a “weak supervision” approach where new text data are automatically assigned to the ground-truthed intents.

Considering the image above, key elements of data labelling are:

Human-In-The-Loop methodology
Accelerated AI-Assisted latent space
Intelligent intent detection and management at scale
Intent splitting, merging, hierarchal or nested intents, deactivation of intents
Detecting intent confusion and disambiguation
Setting intent granularity and cluster sizes

New utterances which are not related to an existing intent are clustered in separate groupings and marked as new, and hence constitutes a new intent.

This process can also be referred to as Intent Driven Design & Development.

Reactive Approach

Unfortunately most chatbot implementations do not follow a Data Centric approach of NLU Design with intents being deduced from business intents and not real-world customer conversations.

Added to this, often training data is synthetically produced or thought-up.

Subsequent to the chatbot launch, a catch-up process ensues where focus is placed on none-intents.

This negative approach misplaces the focus on none-intents (the conversation customers do not want to have), instead of placing the focus where it should be, establishing ground-truthed intents; hence the conversation customers want to have.

I’m currently the Chief Evangelist @ HumanFirst. I explore & write about all things at the intersection of AI and language. Including NLU design, evaluation & optimisation. Data-centric prompt tuning & LLM observability, evaluation & fine-tuning.

Subscribe to HumanFirst Blog

Get the latest posts delivered right to your inbox

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Table of contents

Latest content

Lightspeed Uses HumanFirst for In-House AI Enablement

How Infobip Generated 220+ Knowledge Articles with Gen AI For Smarter Self-Service and Better NPS

AI for CIOs: From One-Off Use to Company-Wide Value

Non-Technical AI Adoption: The Value of & Path Towards Workforce-Wide AI

Building Prompts for Generators in Dialogflow CX

HumanFirst and Infobip Announce a Partnership to Equip Enterprise Teams with Data + Generative AI

Two Field-Tested Prompts for CX Teams

HomeServe Uses HumanFirst to Empower Non-Technical Teams with Conversation Data

Optimizing RAG with Knowledge Base Maintenance

Lightspeed Uses HumanFirst for In-House AI Enablement

How Infobip Generated 220+ Knowledge Articles with Gen AI For Smarter Self-Service and Better NPS

AI for CIOs: From One-Off Use to Company-Wide Value

Better context, better results

Prior To Chatbot Deployment, It Is Essential that Intents are Ground-Truthed To Ensure Accuracy

Reaching targeted levels of intent recognition can be quickly achieved by verifying intents before deployment, rather than adopting a corrective strategy as a response.

Intro

The Challenge

Ground-Truthed Intents

Reactive Approach

Subscribe to HumanFirst Blog