Back to blog
Articles
Articles
September 26, 2023
·
6 min read

What Is The Future Of Prompt Engineering?

September 26, 2023
|
6 min read

Latest content

Customer Stories
4 min read

How Infobip Generated 220+ Knowledge Articles with Gen AI For Smarter Self-Service and Better NPS

Partnering with HumanFirst, Infobip generated over 220 knowledge articles, unlocked 30% of their agents' time, and improved containment by a projected 15%.
September 16, 2024
Articles
7 min read

Non-Technical AI Adoption: The Value of & Path Towards Workforce-Wide AI

Reviewing the state of employee experimentation and organizational adoption, and exploring the shifts in thinking, tooling, and training required for workforce-wide AI.
September 12, 2024
Articles
6 min read

AI for CIOs: From One-Off Use to Company-Wide Value

A maturity model for three stages of AI adoption, including strategies for company leaders to progress to the next stage.
September 12, 2024
Tutorials
4 min read

Building Prompts for Generators in Dialogflow CX

How to get started with generative features.
August 15, 2024
Announcements
3 min read

HumanFirst and Infobip Announce a Partnership to Equip Enterprise Teams with Data + Generative AI

With a one-click integration to Conversations, Infobip’s contact center solution, HumanFirst helps enterprise teams leverage LLMs to analyze 100% of their customer data.
August 8, 2024
Tutorials
4 min read

Two Field-Tested Prompts for CX Teams

Get deeper insights from unstructured customer data with generative AI.
August 7, 2024
Tutorials
5 min read

Optimizing RAG with Knowledge Base Maintenance

How to find gaps between knowledge base content and real user questions.
April 23, 2024
Tutorials
4 min read

Scaling Quality Assurance with HumanFirst and Google Cloud

How to use HumanFirst with Vertex AI to test, improve, and trust agent performance.
March 14, 2024
Customer Stories
4 min read

How Infobip Generated 220+ Knowledge Articles with Gen AI For Smarter Self-Service and Better NPS

Partnering with HumanFirst, Infobip generated over 220 knowledge articles, unlocked 30% of their agents' time, and improved containment by a projected 15%.
September 16, 2024
Articles
7 min read

Non-Technical AI Adoption: The Value of & Path Towards Workforce-Wide AI

Reviewing the state of employee experimentation and organizational adoption, and exploring the shifts in thinking, tooling, and training required for workforce-wide AI.
September 12, 2024
Articles
6 min read

AI for CIOs: From One-Off Use to Company-Wide Value

A maturity model for three stages of AI adoption, including strategies for company leaders to progress to the next stage.
September 12, 2024

Let your data drive.

Articles

What Is The Future Of Prompt Engineering?

COBUS GREYLING
September 26, 2023
.
6 min read

The skill of Prompt Engineering has been touted as the ultimate skill of the future. But, will prompt engineering be around in the near future? In this article I attempt to decompose how the future LLM interface might look like…considering it will be conversational.

Introduction

A year ago, Sam Altman was asked the following question, how will most users interact with Foundation Models in five years time?

Sam Altman responded saying that he does not think we will be using prompt engineering in five years time. He highlighted two main modalities of input, voice or text. And a complete unstructured UI where natural language is the input and the model acting on the instruction of the human.

To some degree this statement is jarring due to the fact that so much technology and innovation is based on the basic principle of prompt engineering.

One can argue that Prompt Engineering is natural language structured in a certain way. But it seems like what the CEO of OpenAI is referring to, are truly unstructured and highly intuitive interfaces.

The best way to try and under stand the future of LLM interfaces is to start by breaking down the use-cases.

There are two main use-cases; personal and enterprise implementations. Personal use or personal digital assistants currently in circulation are HuggingChat, ChatGPT and Cohere Coral.

What interests me most is highly scaleable enterprise implementations.

Considering enterprise use-cases, the two main approaches for manipulating the LLM is [1] fine-tuning and [2] injecting contextual reference (RAG) data into the prompt at inference (RAG). One approach does not necessarily replace the other. Model Fine-Tuning changes the behaviour and response of the LLM. RAG supplements the user’s input with a contextual reference.

The Problem Of Context & Ambiguity

One of the main challenges of Prompt Engineering is problem statement and problem formulation. Translating a requirement existing in thought, into a text request.

Taking a step back, with traditional chatbots establishing context is very important. Context is first established by classifying the user input according to one or more intents. Further context is established via previous conversations, API calls to CRM systems, etc.

Context also largely depend on time and place, and what our spatial reference is at the time of asking the question.

Apart from contextual awareness, ambiguity is also a challenge. Considering the example below, this form of ambiguity is easy for us as humans to decode; but traditionally harder for NLU/chatbots.

However, consider how well OpenAI answers the question below via the gpt-3.5-turbo model:

But there is ambiguity which is impossible to resolve with multiple meanings; an example of a sentence which is truly ambiguous and requires disambiguation is: I saw Tom with binoculars.

Structured Conversation

Even-though Sam Altman’s prediction speaks of the interface being more natural and less prompt engineered, there is an interesting phenomenon taking place in terms of LLM input and output.

We saw with OpenAI the Edit and Complete modes being marked as legacy (scheduled for deprecation) and the chat completion mode being set as a de facto standard.

I always argue that complexity needs to be accommodated somewhere, either the UX/UI is more complex…or the user is given an unstructured natural language interface; which in turn then necessitates complexity on the solution side.

With ChatML roles are defined and a definite structure is given to the input to the LLM. As seen in the code example below. Hence the input mode is more conversational, but there is underlying structure imposed which needs to be adhered to.

Conclusion

  • Intent, Context, ambiguity and disambiguation are all part and parcel of any conversation.
  • With human language interface, context and contextual awareness will always be important.
  • Ambiguity exists, and a level of disambiguation is part and parcel of human conversation. In human conversation we continuously use disambiguation establish meaning and intent.
  • Disambiguation can be automated to some degree with in-context auto-learning. This is the process of the conversational UI understanding how to disambiguate in various scenarios based on user input into disambiguation menus.
  • For instance, asking a LLM to generate five options, and you as the user select the best generated answer, is a form of disambiguation.
  • Enterprise specific implementation will be domain specific and will demand constraints, with a level of out-of-domain detection.
  • Problem or request decomposition will aways be important to create a chain of thought reasoning process.
  • Fine-Tuning will set model behaviour and RAG will supply inference specific context.
  • RAG creates a contextual reference for the LLM to use during inference.
  • Data Management will always be part of LLM applications.
  • The principle of Soft Prompts should be kept in mind. Soft Prompts are created during the process of prompt tuning. Unlike hard prompts, soft prompts cannot be viewed and edited in text.Prompts consist of an embedding, a string of numbers, that derives knowledge from the larger model.

I’m currently the Chief Evangelist @ HumanFirst. I explore & write about all things at the intersection of AI & language; ranging from LLMs, Chatbots, Voicebots, Build Frameworks, natural language data productivity suites & more.

Subscribe to HumanFirst Blog

Get the latest posts delivered right to your inbox