When a new housing society emerges, residents inevitably create chat groups to connect and share information using various chat apps like WhatsApp and Telegram. In India, Telegram seems to be the favorite as it provides generous group limits, admin tools, among other features. These virtual communities become treasure troves of invaluable insights. But whatever app you use, there is always a problem of finding the right information at right time. Sure, the apps have a "Search" button, but they are pretty much limited to keyword search and are useless when you have to search through thousands of messages.
I found myself in this situation when it was my turn to start on an interior design project for my home. Despite being part of a vibrant Telegram group, where countless residents had shared their experiences with various interior designers and companies, I struggled to unearth the pearls of wisdom buried within the chat's depths. I remembered that I could take advantage of AI, particularly LLMs to sift through this data.
Telegram has a nifty feature where you can request for a data dump of a group that you are part of. The data is generally available 24 hours after request. Since my group had more than 50,000 messages, the data was shared in multiple HTML files containing the user chats.
Telegram Group Data Dump |
I needed to extract this into a simple text document that I can feed to an LLM.
I wrote a simple C# program to do this:
Finally, I could narrow it down to ~ 1300 messages. Next, I chunked these messages further into 200 messages each to avoid the context token limit of Anthropic's Claude Sonnet LLM and used Amazon Bedrock to extract the information.
Now I got a neat summary of information for each batch:
Based on the chat messages, here are the names and phone numbers of interior companies or persons who received positive feedback for good quality work delivered on time:
1. Redacted - One user mentioned finalizing interiors with them.
2. Redacted - A user had a pleasant experience with them for interiors at Redacted.
3. Redacted - A user's friend got good interiors done by them at Redacted.
4. Redacted - Recommended by a user for their great work on a villa.
5. Redacted - Quoted around 17L for a 2.5 BHK at Redacted, and one user liked their work.
6. Redacted - Did the model flat interiors at Redacted. A user mentioned they are open to discounts for bulk orders.
Some other companies mentioned without specific feedback were Redacted, Redacted and Redacted. A few users cautioned against Redacted based on poor experiences at Redacted.
The results were nothing short of remarkable. I received a meticulously curated summary, complete with names, phone numbers, and feedback on interior companies that had garnered praise for their quality work and timely delivery. Unexpectedly, the LLM even went the extra mile, cautioning against companies with poor track records, ensuring I had a well-rounded perspective. Now I had 7 set of responses which I fed again to the LLM to generate a neat list of good and bad companies!
Was using an LLM an overkill for this task? Perhaps. Was it fun? You bet!
Comments
Post a Comment
As far as possible, please refrain from posting Anonymous comments. I would really love to know who is interested in my blog! Also check out the FAQs section for the comment policy followed on this site.