Clearing up confusion: GPT 3.5-Turbo may not be 20b after all
Cross-posting this from Reddit So one thing that had really bothered me was that recent Arxiv paper claiming that despite GPT 3 being 175B, and GPT 4 being around 1.7T, somehow 3.5 Turbo was 20b. This had been on my mind for the past couple of days because