Working with LLMs
I’ve followed Rui Carmo for more years than I’d care to remember. I laughed out loud at his latest post trying to get a tool which gives GPT a harness to use web search and scraping.
“I got it running inside a sandbox and asked it to summarize the top ten Hacker News posts...The thing is, NONE OF THOSE ARTICLES ACTUALLY EXISTED. The HN home page looked nothing like it….Somehow gpt-3.5-turbo decided to hallucinate them instead of using the web scraper, which just goes to show you how stupefyingly fallible these things are right now.”
That’s the key thing to remember with these models. It wants to accomplish the goal and how it does that, well that’s where hallucinations can crop up.
Large, diverse and dispersed data summaries would be fantastic (I still read RSS feeds). I’m not tackling it yet though.
(This is a good read about LLMs as well)