Has new updates to my Natural Language Bot lead to more Customer Retention and Satisfaction?

yasha kaushal
Feb 5, 2024
2 min read

Keywords - A/B testing, Hypothesis testing, NLP, Sentiment Analysis, Power analysis, Transformers, Linear Discriminant Analysis (LDA), ChatGPT, NLTK, Python, Pandas, Matplotlib

Natural Language Bots are becoming exponentially popular in enhancing customer experiences and improving business revenues, especially when integrated on websites compared to traditional bots .

However, more quantitative studies are needed to understand the satisfaction and pain-points of customer interactions. We analyze conversations with 2 versions of personalized chatbot (ChatGPT taken for demonstration purposes) across the globe in various languages and statistically test if upgraded version is leading to better customer satisfaction or not.

The chatbot market is expected to grow from ~1 billion dollar to ~25 billion dollars in a 10 year span from 2020 - 2030

Business Questions

Identify ways of quantifying customer experience via personalized chat bot interaction.
Is staying on top of technology integration with current platforms actually increasing business revenues?
What are the pain-points in this exchange for future customizations and updates in the bot - drop-off topics, customer retention rate etc?
Are new customizations leading to an expected positive impact (A/B testing)?
Measure long-term impact on business ROI.

In this project -

We perform sentiment analysis on A-data and B-data to get the % of conversations with high-confidence positive user-experience (sentiment) before and after the bot upgrade.
We perform Power Analysis to get minimum number of samples required from both the datasets to have statistically significant results
We find that there is 2.5 % increase (57.8 % to 60.2 %) in % of high-confidence positive sentiment conversations.
This is statistically significant and as expected !
We also create WordCloud of high-confidence negative sentiment conversations and find that majority of the topics were related to programming, code, data and modeling.

Data

Real world use case would be a company's webpage with Natural Language Bot assisting in purchases. This interaction decides if a customer ends up buying a product or not (conversion rate).

We will be using ChatGPT-3.5 and ChatGPT-4 conversation data for demonstration purposes. We have conversation exchanges with ChatGPT from users across the globe collected over a period of -

6 months (Nov'22 - Apr'23, hereafter called A-data) - 52,000 conversations
3 months (Apr'23 - Jun'23, hereafter called B-data) - 40,000 conversations

This data is retrieved from the shareGPT platform in json format with 3 columns -

'id', - unique id of a conversation exchange
'from' - 'human' or 'gpt'
'value' - natural language text of the interaction

This study goes one step further on recently published work on this dataset (10th Dec 2023) - Early ChatGPT User Portrait through the Lens of Data.

Authors found GPT to be POSITIVE (>80%), NEUTRAL (8%) and NEGATIVE (12%) even when given with negative prompts, suggesting overall positive tone of this Natural Language Bot.

All the datasets used in the notebook could be found here - https://pitt-my.sharepoint.com/:f:/g/personal/yak39_pitt_edu/EvsVKJeRcnRAm82vqlrlrs8Bpj9ifBUwdzTAn0J0AVRMMw?e=wyT4Er

Has new updates to my Natural Language Bot lead to more Customer Retention and Satisfaction?

Business Questions

In this project -

Data

Recent Posts

Comments