One of the UK’s leading retail and commercial banks, serving millions across personal and business banking. The bank manages thousands of daily chatbot and phone interactions, generating large volumes of unstructured data that was previously underutilised due to scale, quality issues, and reliance on vendor-led analysis.
Build an in-house NLP capability to extract actionable insights from customer interactions thus improving service quality, chatbot responses, first-call resolution, and agent performance benchmarking.
Chat and telephonic transcripts were ingested from the client’s S3 environment. Invalid and low-quality records were filtered, and programmatic labelling was performed using semantic matching, custom scenarios, and business-rule logic.
Transcripts were standardised using part-of-speech tagging, lemmatisation, named entity recognition for entity removal, and stop-word elimination to reduce noise.
Text was converted into word and sentence embeddings. High-dimensional vectors were reduced using UMAP, and bi-grams were extracted to create structured model inputs.
Intent modelling evaluated LDA and BERTopic, with BERTopic selected for interpretable clustering and active-learning feedback. Call resolution used semi-supervised fastText models, refined iteratively through SME-guided active learning.
Outputs were validated using probability scores and SME review, with intent aligned to chatbot categories and resolution assessed using precision, recall, and F1-score.
The solution produced intent outputs covering primary and secondary reasons for calling with supporting sentences and keywords, along with call-resolution outcomes with confidence scores and contextual evidence.