- Homescreen
- Posts
- š Arena Ambush
š Arena Ambush
The anonymous-chatbot
GM! Itās Brett again. If you're a founder putting in long hours but fueling your body with crap, you're gonna have a very bad time. Trust me, I've been that zombie founder. Now I've hacked my diet: I eat the same thing for lunch and dinner every day.
BROUGHT TO YOU BY:

š AI
Leaderboard Leap

Recently a mystery AI model popped up in the LMSYS Chatbot Arena and itās based on GPT-4 architecture.
New model on arena again. It is simply named 'anonymous-chatbot' this time. There is speculation already that this is Q*. Whatever it is, something new is on the way.
ā Andrew Curran (@AndrewCurran_)
5:13 AM ⢠Aug 7, 2024
Early testers are saying it's leaving GPT-4o in the dust when it comes to reasoning skills.
Anonymous chatbot was able to solve all my test puzzles on the first attempt.
And the strawberry test!
ā FLOWERS (@flowersslop)
5:25 AM ⢠Aug 7, 2024
It totally out performed GPT 4o for me on a test, and my own prompt, not something that could be memorized.
ā AI Machine Dream (@AIMachineDream)
6:59 AM ⢠Aug 7, 2024
Now, you know how this goes. The rumor mill started churning instantly. Is this OpenAI's top-secret Q* model? Or maybe its evolution, "Project Strawberry"? Adding fuel to the fire, Sam tweeted this -
i love summer in the garden
ā Sam Altman (@sama)
3:29 PM ⢠Aug 7, 2024
The thing is that OpenAI's pulled this stealth testing stunt before. They tested GPT-4o with gpt2-chatbot 2 weeks before the public release.
But that's not the only action in the Arena. Google's been making waves too. Their new Gemini 1.5 Pro (Experimental 0801) just crushed it after a week of community testing. We're talking over 12K votes here.
Exciting News from Chatbot Arena!
@GoogleDeepMind's new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes.
For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive⦠x.com/i/web/status/1ā¦
ā lmsys.org (@lmsysorg)
4:33 PM ⢠Aug 1, 2024
For the first time ever, Gemini snagged the #1 spot, beating out GPT-4o and Claude-3.5 with a crazy high score of 1300. It's even topping the Vision Leaderboard.
The big picture: The AI race is red hot. We've got mystery models, stealth releases, and established players leapfrogging each other. This isn't just about bragging rights or leaderboard positions. These advancements could redefine what's possible in natural language processing, multimodal understanding, and reasoning capabilities.
The tools at our disposal are about to get a lot more powerful, and the competitive landscape a whole lot more interesting

Everyone tells you to learn AI but no one tells you where.
We have partnered with Growthschool to bring this ChatGTP & AI Workshop to our readers. It is usually $199, but free for you because you are our loyal readers š
This workshop has been taken by 1 Million people across the globe, who have been able to:
Build business that make $10,000 by just using AI tools
Make quick & smarter decisions using AI-led data insights
Write emails, content & more in seconds using AI
Solve complex problems, research 10x faster & save 16 hours every week
Youāll wish you knew about this FREE AI Training sooner (Btw, itās rated at 9.8/10 ā)

š¤ THE LATEST INā¦
TECH
Intelās cutting over 15% of its workforce.
Disneyās streaming biz announced itās first-ever profitable quarter.
Googleās hit with a huge antitrust loss over its search monopoly.
DeepMind's robot is now smashing table tennis like a pro.
AI
Google got caught running ads for AI apps that create non-consensual deepfakes.
Palantir and Microsoft are teaming up to supercharge AI for US defense and intelligence.
Zuck says Meta needs 10x more compute power to train Llama 4 than Llama 3.
Figureās new humanoid robot, powered by OpenAI, chats naturally on the factory floor.
SCIENCE
AI just nailed 3D drug targets, speeding up the hunt for mental health treatments.
Stanfordās psychologist says his AI can read your mind just by scanning your face.
Scientists just found bacteria with hidden genes floating around.
Heat waves are soaring, and so are A/Cās carbon emissions.
CRYPTO
Rippleās top lawyer spills the beans on their game plan after getting hit with a $125M SEC fine.
Crypto's riding high with $176M inflows last week, and etherās leading the charge as sentiment stays bullish.
Coinbase slams the CFTC's new rule that could kill off many prediction markets.

šāāļø QUICKIES
Raise: Anduril, a defense tech startup, raised $1.5B in a funding round, bumping its valuation up to $14B, to ramp up their new Arsenal factory.
Stat: $9B. Thatās the value Warner Bros. Discovery wrote down for its TV assets thanks to falling cable fees and sports rights troubles.
Rabbit Hole: What a century (plus a pandemic) does to moviegoing and why it matters (Matthew Ball)

𤩠MONDAY MOTIVATION

š ļø FOUNDERS CORNER
The best resources we recently came across that will help you become a better founder, builder, or investor.
šÆ Feedefy makes collecting user feedback easy
šØ Navier AI makes CFD simulations 1000x faster
š§ BrainBird is the image editor youāve been looking for

āļø AI GENERATED OR NOT

Is this AI generated? |

HOW WAS TODAY'S NEWSLETTER? |

REACH 40K+ FOUNDERS, INVESTORS & OPERATORS
If youāre interested in advertising with us, send an email over to [email protected] with the subject āHomescreen Adsā.er