TY - JOUR
T1 - A survey of the state of the art in conversational question answering systems
AU - Perera, Manoj Madushanka
AU - Mahmood, Adnan
AU - Wijethilake, Kasun Eranda
AU - Islam, Fahmida
AU - Tahermazandarani, Maryam
AU - Sheng, Quan Z.
PY - 2026/12
Y1 - 2026/12
N2 - Conversational question answering (ConvQA) systems have emerged as a pivotal area within natural language processing (NLP) by driving advancements that enable machines to engage in dynamic and context-aware conversations. These capabilities are increasingly being applied across various domains, i.e., customer support, education, legal, and healthcare where maintaining a coherent and relevant conversation is essential. Building on recent advancements, this survey provides a comprehensive analysis of the state of the art in ConvQA. This survey begins by examining the core components of ConvQA systems, i.e., history selection, question understanding, and answer prediction, highlighting their interplay in ensuring coherence and relevance in multi-turn conversations. It further investigates the use of advanced machine learning techniques, including but not limited to, reinforcement learning, contrastive learning, and transfer learning to improve ConvQA accuracy and efficiency. The pivotal role of large language models, i.e., RoBERTa, GPT-4, Gemini 2.0 Flash, Mistral 7B, and LLaMA 3, is also explored, thereby showcasing their impact through data scalability and architectural advancements. Additionally, this survey presents a comprehensive analysis of key ConvQA datasets and concludes by outlining open research directions. Overall, this work offers a comprehensive overview of the ConvQA landscape and provides valuable insights to guide future advancements in the field.
AB - Conversational question answering (ConvQA) systems have emerged as a pivotal area within natural language processing (NLP) by driving advancements that enable machines to engage in dynamic and context-aware conversations. These capabilities are increasingly being applied across various domains, i.e., customer support, education, legal, and healthcare where maintaining a coherent and relevant conversation is essential. Building on recent advancements, this survey provides a comprehensive analysis of the state of the art in ConvQA. This survey begins by examining the core components of ConvQA systems, i.e., history selection, question understanding, and answer prediction, highlighting their interplay in ensuring coherence and relevance in multi-turn conversations. It further investigates the use of advanced machine learning techniques, including but not limited to, reinforcement learning, contrastive learning, and transfer learning to improve ConvQA accuracy and efficiency. The pivotal role of large language models, i.e., RoBERTa, GPT-4, Gemini 2.0 Flash, Mistral 7B, and LLaMA 3, is also explored, thereby showcasing their impact through data scalability and architectural advancements. Additionally, this survey presents a comprehensive analysis of key ConvQA datasets and concludes by outlining open research directions. Overall, this work offers a comprehensive overview of the ConvQA landscape and provides valuable insights to guide future advancements in the field.
KW - Conversational AI
KW - Conversational history management
KW - Large language models
KW - Machine learning
KW - Natural language processing
UR - http://www.scopus.com/inward/record.url?scp=105026452038&partnerID=8YFLogxK
U2 - 10.1007/s10115-025-02620-1
DO - 10.1007/s10115-025-02620-1
M3 - Review article
AN - SCOPUS:105026452038
SN - 0219-1377
VL - 68
SP - 1
EP - 50
JO - Knowledge and Information Systems
JF - Knowledge and Information Systems
IS - 1
M1 - 31
ER -