Author

Cobus Greyling

Cobus Greyling is passionate about exploring the intersection of AI and language. From Language Models, AI Agents to Agentic Applications, Development Frameworks & Data-Centric Productivity Tools, I share insights and ideas on how these technologies are shaping the future.

‍

Chief Evangelist

Kore.Ai

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Published Jun 12, 2025

4 Min Read

Read

AI agents, RAG, and agentic retrieval for enterprises

Published Jun 25, 2025

4 Min Read

Read

How MCP enables smarter, adaptive AI agent integration

Published Jun 17, 2025

4 Min Read

Read

Agentic RAG is the next step in smarter enterprise AI

Published May 09, 2024

4 Min Read

Read

Putting AI to work

Published May 10, 2024

4 Min Read

Read

The case for an AI productivity suite

Published May 14, 2024

4 Min Read

Read

Rapid development of intelligent generative AI APIs

Published May 17, 2024

4 Min Read

Read

The large language model landscape — Version 5

Published May 22, 2024

4 Min Read

Read

Three considerations for private open-source LLM instances

Published May 21, 2024

4 Min Read

Read

A short history of chatbots

Published May 28, 2024

4 Min Read

Read

TinyLlama is an open-source small language model

Published Jun 04, 2024

4 Min Read

Read

Five levels of AI agents

Published Jun 06, 2024

4 Min Read

Read

Data design for fine-tuning LLM long context windows

Published Jun 03, 2024

4 Min Read

Read

LLM agents vs. chains: Differences & when to use each

Published Jun 14, 2024

4 Min Read

Read

Teaching LLMs To say, “I don’t know”

Published Jun 10, 2024

4 Min Read

Read

LLM performance over time & LLM task contamination

Published Jun 12, 2024

4 Min Read

Read

Improving text embeddings with LLM generated synthetic data

Published Jun 16, 2024

4 Min Read

Read

Active prompting with chain-of-thought for large language models

Published Jun 19, 2024

4 Min Read

Read

Prompt pipelines

Published Jul 01, 2024

4 Min Read

Read

Small & medium-sized enterprises enter the conversational AI arena

Published Jul 03, 2024

4 Min Read

Read

Meta taxonomy of large language model correction & refinement

Published Jul 07, 2024

4 Min Read

Read

Large language model (LLM) SWOT analysis

Published Jul 04, 2024

4 Min Read

Read

Retrieval-Augmented generation (RAG) vs LLM fine-tuning

Published Jul 10, 2024

4 Min Read

Read

Visualise & discover RAG data

Published Jul 12, 2024

4 Min Read

Read

The new corner store is digital

Published Jul 12, 2024

4 Min Read

Read

AI-Assisted UI design with AI for service

Published Jul 12, 2024

4 Min Read

Read

Automate your contact center faster with AI for service

Published Jul 12, 2024

4 Min Read

Read

Automation and orchestration with AI for service

Published Jul 12, 2024

4 Min Read

Read

Introducing effortless no-code conversational UI onboarding

Published Jul 12, 2024

4 Min Read

Read

The corner store can have a chatbot

Published Jul 12, 2024

4 Min Read

Read

Helping SMBs build conversational AI

Published Jul 12, 2024

4 Min Read

Read

Making conversational AI accessible for small and medium organisations

Published Jul 26, 2024

4 Min Read

Read

No-code deployment and orchestration of open-sourced foundation models

Published Jul 22, 2024

4 Min Read

Read

Prompt-RAG: Vector embedding free retrieval-augmented generation

Published Aug 12, 2024

4 Min Read

Read

Corrective RAG: Boosting response quality

Published Aug 14, 2024

4 Min Read

Read

Adding noise improves RAG performance

Published Aug 29, 2024

4 Min Read

Read

The case for small language models

Published Aug 30, 2024

4 Min Read

Read

The shifting vocabulary of AI

Published Sep 05, 2024

4 Min Read

Read

Fine-Tuning or RAG?

Published Sep 19, 2024

4 Min Read

Read

LLM drift, prompt drift & cascading

Published Sep 23, 2024

4 Min Read

Read

The evolution of grounding & planning In AI agents

Published Sep 20, 2024

4 Min Read

Read

Beyond LLMs: The shift to smaller, smarter AI models

Published Sep 26, 2024

4 Min Read

Read

Please stop saying long context windows will replace RAG

Published Oct 14, 2024

4 Min Read

Read

Small language models: Purpose & potential

Published Oct 04, 2024

4 Min Read

Read

Self-Reflective Retrieval-Augmented Generation (SELF-RAG)

Published Sep 28, 2024

4 Min Read

Read

Prompt-RAG

Published Oct 15, 2024

4 Min Read

Read

Evaluating LLM voting under scaling laws

Published Oct 21, 2024

4 Min Read

Read

RAT — Retrieval Augmented Thoughts

Published Oct 16, 2024

4 Min Read

Read

Chain-of-Instructions (CoI) fine-tuning

Published Oct 27, 2024

4 Min Read

Read

DRAGIN: Dynamic RAG based on real-time information needs of LLMs

Published Oct 24, 2024

4 Min Read

Read

A study comparing RAG & Fine-Tuning for knowledge base use-cases

Published Oct 30, 2024

4 Min Read

Read

Retrieval Augmented Fine-Tuning (RAFT)

Published Oct 30, 2024

4 Min Read

Read

Large language models excel at In-Context Learning (ICL)

Published Nov 11, 2024

4 Min Read

Read

FIT-RAG

Published Nov 01, 2024

4 Min Read

Read

Adaptive-RAG

Published Nov 15, 2024

4 Min Read

Read

FaaF: Facts as a function for evaluating RAG

Published Nov 18, 2024

4 Min Read

Read

Agentic Search-Augmented Factuality Evaluator (SAFE) for LLMs

Published Nov 03, 2024

4 Min Read

Read

Challenges in adopting RAG solutions

Published Nov 14, 2024

4 Min Read

Read

Disambiguation: Dynamic context for effective RAG question suggestions

Published Nov 21, 2024

4 Min Read

Read

Data design for fine-tuning to improve small language model behaviour

Published Nov 21, 2024

4 Min Read

Read

Improve conversational UIs using social intelligence

Published Nov 25, 2024

4 Min Read

Read

Can small errors disrupt RAG pipelines?

Published Jan 06, 2025

4 Min Read

Read

DialogGPT

Published Jan 07, 2025

4 Min Read

Read

How should large language models be evaluated?

Published Jan 08, 2025

4 Min Read

Read

OPRO explained: Smarter prompt design

Published Jan 10, 2025

4 Min Read

Read

LLM alignment, hallucination & misinformation

Published Feb 10, 2025

4 Min Read

Read

As-Needed decomposition & planning using large language models ( ADaPT)

Published Jan 13, 2025

4 Min Read

Read

Evaluating LLMs: What, where, and how

Published Jan 17, 2025

4 Min Read

Read

Chain of Empathy Prompting (CoE)

Published Jan 15, 2025

4 Min Read

Read

What are LLMs good at & when can LLMs fail?

Published Jan 20, 2025

4 Min Read

Read

Are emergent abilities In LLMs inherent or merely In-Context Learning?

Published Jan 22, 2025

4 Min Read

Read

LLM hallucination index

Published Jan 24, 2025

4 Min Read

Read

Knowledge-Driven Chain-of-Thought (KD-CoT)

Published Jan 27, 2025

4 Min Read

Read

Chain-Of-Note (CoN) retrieval for LLMs

Published Jan 31, 2025

4 Min Read

Read

Open ai structured JSON output with adherence

Published Jan 30, 2025

4 Min Read

Read

The Chain-Of-X phenomenon In LLM prompting

Published Feb 05, 2025

4 Min Read

Read

The anatomy Of Chain-Of-Thought prompting (CoT)

Published Feb 03, 2025

4 Min Read

Read

Contrastive Chain-Of-Thought prompting

Published Feb 07, 2025

4 Min Read

Read

Self-Consistency for Chain-Of-Thought prompting

Published Feb 12, 2025

4 Min Read

Read

Generative AI trends to watch in 2025 & beyond

Published Feb 14, 2025

4 Min Read

Read

Data delivery to LLMs

Published Feb 16, 2025

4 Min Read

Read

Large language model programs

Published Feb 18, 2025

4 Min Read

Read

What is multi-task language understanding (MMLU)

Published Feb 20, 2025

4 Min Read

Read

Random CoT for better LLM reasoning

Published Feb 25, 2025

4 Min Read

Read

Large language model hallucination mitigation techniques

Published Mar 03, 2025

4 Min Read

Read

Validating low-confidence LLM generation

Published Mar 14, 2025

4 Min Read

Read

Considering large language model reasoning step length

Published Mar 12, 2025

4 Min Read

Read

Chain Of Natural Language Inference (CoNLI)

Published Mar 17, 2025

4 Min Read

Read

Meta taxonomy of LLM correction & refinement

Published Mar 24, 2025

4 Min Read

Read

Understanding LLM user experience & expectation

Published Mar 27, 2025

4 Min Read

Read

Concise Chain-of-Thought (CCoT) prompting

Published Mar 31, 2025

4 Min Read

Read

UniMS-RAG: Unified multi-source RAG for personalised dialogue

Published Mar 28, 2025

4 Min Read

Read

Chain-of-Symbol prompting (CoS) for LLMs

Published Apr 07, 2025

4 Min Read

Read

A benchmark for verifying chain-of-thought

Published Apr 02, 2025

4 Min Read

Read

Seven RAG engineering failure points

Published Apr 08, 2025

4 Min Read

Read

Designing conversational UIs that match user intent

Published Apr 09, 2025

4 Min Read

Read

Comparing human, LLM & LLM-RAG responses

Published Apr 10, 2025

4 Min Read

Read

Beyond Chain-of-Thought LLM reasoning

Published Apr 14, 2025

4 Min Read

Read

T-RAG = RAG + fine-tuning + entity detection

Published Apr 17, 2025

4 Min Read

Read

Demonstrate, search, predict (DSP) for LLMs

Published Apr 23, 2025

4 Min Read

Read

Agent Platform { Artemis }

For Service

For Work

Use Case Library

Kore.ai Marketplace

Agent Platform

Cobus Greyling

All Brochures

AI agents, RAG, and agentic retrieval for enterprises

How MCP enables smarter, adaptive AI agent integration

Agentic RAG is the next step in smarter enterprise AI

Putting AI to work

The case for an AI productivity suite

Rapid development of intelligent generative AI APIs

The large language model landscape — Version 5

Three considerations for private open-source LLM instances

A short history of chatbots

TinyLlama is an open-source small language model

Five levels of AI agents

Data design for fine-tuning LLM long context windows

LLM agents vs. chains: Differences & when to use each

Teaching LLMs To say, “I don’t know”

LLM performance over time & LLM task contamination

Improving text embeddings with LLM generated synthetic data

Active prompting with chain-of-thought for large language models

Prompt pipelines

Small & medium-sized enterprises enter the conversational AI arena

Meta taxonomy of large language model correction & refinement

Large language model (LLM) SWOT analysis

Retrieval-Augmented generation (RAG) vs LLM fine-tuning

Visualise & discover RAG data

The new corner store is digital

AI-Assisted UI design with AI for service

Automate your contact center faster with AI for service

Automation and orchestration with AI for service

Introducing effortless no-code conversational UI onboarding

The corner store can have a chatbot

Helping SMBs build conversational AI

Making conversational AI accessible for small and medium organisations

No-code deployment and orchestration of open-sourced foundation models

Prompt-RAG: Vector embedding free retrieval-augmented generation

Corrective RAG: Boosting response quality

Adding noise improves RAG performance

The case for small language models

The shifting vocabulary of AI

Fine-Tuning or RAG?

LLM drift, prompt drift & cascading

The evolution of grounding & planning In AI agents

Beyond LLMs: The shift to smaller, smarter AI models

Please stop saying long context windows will replace RAG

Small language models: Purpose & potential

Self-Reflective Retrieval-Augmented Generation (SELF-RAG)

Prompt-RAG

Evaluating LLM voting under scaling laws

RAT — Retrieval Augmented Thoughts

Chain-of-Instructions (CoI) fine-tuning

DRAGIN: Dynamic RAG based on real-time information needs of LLMs

A study comparing RAG & Fine-Tuning for knowledge base use-cases

Retrieval Augmented Fine-Tuning (RAFT)

Large language models excel at In-Context Learning (ICL)

FIT-RAG

Adaptive-RAG

FaaF: Facts as a function for evaluating RAG

Agentic Search-Augmented Factuality Evaluator (SAFE) for LLMs

Challenges in adopting RAG solutions

Disambiguation: Dynamic context for effective RAG question suggestions

Data design for fine-tuning to improve small language model behaviour

Improve conversational UIs using social intelligence

Can small errors disrupt RAG pipelines?

DialogGPT

How should large language models be evaluated?

OPRO explained: Smarter prompt design

LLM alignment, hallucination & misinformation

As-Needed decomposition & planning using large language models ( ADaPT)

Evaluating LLMs: What, where, and how

Chain of Empathy Prompting (CoE)

What are LLMs good at & when can LLMs fail?

Are emergent abilities In LLMs inherent or merely In-Context Learning?

LLM hallucination index

Knowledge-Driven Chain-of-Thought (KD-CoT)