Remix Hacker News Clone

170

Muse Spark: Scaling towards personal superintelligence

I don't get the comments trashing this. If it slightly beats or even matches Opus 4.6, it means Meta is capable of building a model competitive with the leading AI company. Sure, they spent a lot of money and will have on-going costs. But how much more work would it take to turn that into a coding agent people are willing to try (and pay for) along side their usage of a collection of agents (Claude, Codex, etc)? Also means Meta doesn't have to pay another company to use a SATA model across all their products (including IG and WhatsApp, vr) which will matter to their balance sheet long term (despite the constant r&d spend).

by tty4561775668416

Comes impressively close to GPT 5.4 / Gemini 3.1 Pro / Opus 4.6! Mostly behind OpenAI on coding/agentic benchmarks, behind Google on text reasoning, behind Anthropic on Humanity's Last Exam with tools (surprisingly the only benchmark where Anthropic leads currently).

Meta hasn’t fully caught up, but they came close and I think can solidly claim to be a frontier lab again. I’d call it a 3.5 horse race right now, and hopefully their next model improves. More model competition is good!

Poor Grok 4.2 should probably be dropped from the table.

by granzymes1775679833

This really reinforces the idea that the AI race and the Railroad Mania of the 19th century are very similar.

So many different companies are going to have similarly powerful ai that there will be no moat around it and it will be cheap. They will never earn their investment back.

by daft_pink1775666960

Ran some of my internal benchmarks against this and I'm very unimpressed. I don't think this moves them into the OAI v Anthropic v Gemini conversation at all.

Major analytical errors in their response to multiple of my technical questions.

by creddit1775667033

Personal as in Meta gets your personal data so they can sell you more ads.

by glerk1775671001

The hero image on the linked page, which consists of a muted teal background with the words "Introducing Muse Spark", weighs in at 3,5MB. I don't even...

by hackrmn1775668536

Meta is in a weird spot. They caught up late to the game and instead of releasing llama as a chat bot they open sourced it, precisely because they lost the mind share. They thought chatbot is not their product and I am sure they are regretting it now. Mark is obsessed with becoming the android of something and he poured billions into the metaverse thinking he is first and failed. He then open sourced llama and wanted to be the android of llms. He ended up enabling groq but it didn’t benefit meta directly at all. They have no revenue or mind share path from llms but continue to pour billions into it. The only 1-1 mapping is with the glasses but that is a tough fit for the company given they are extremely allergic to privqcy and security.

Not sure what this is now.

by yalogin1775673244

Litmus test: what % of meta engineers are using muse vs Claude code? Last i heard it was mostly claude code. Tell you everything you need to know about how serious these benchmarks are.

by dbgrman1775679302

Uploading images requires logging in. Logging in is broken. It redirects to https://meta.ai/?error=Token%20exchange%20failed and doesn't show any error message. Impressive.

by spearman1775678714

"Muse Spark is available now, and Contemplating mode will be rolling out gradually in meta.ai."

How does one get their hands on these models? They are not open-source, right? I go to meta.ai, but it's just a chat interface---no equivalent to codex or claud code? Can you use this through OpenCode? Is meta charging for model access, or is the gathering of chat data a sufficiently large tithe?

by moab1775664260

Benchmarks are meaningless until the pelican benchmark comes out: https://simonwillison.net/

by btown1775678943

We all know it... but I think they were very bold in this warning about using your private messages to train public models. _Your messages with AIs will be used to improve AI at Meta. Don't share information, including sensitive topics, about others or yourself that you don't want the AI to retain and use_

by bguberfain1775669086

How is that Meta spent so much money for talent and hardware, but the model barely matches Opus 4.6?

Especially, looking at these numbers after Claude Mythos, feels like either Anthropic has some secret sauce, or everyone else is dumber compared to the talent Anthropic has

by throwaw121775666636

Sounds like a good effort. They are choosing to focus on multi-modality - perhaps they are taking a different route here to Anthropic.

I don't like that I need to login to my FB/Instagram account to access this.

by spprashant1775678585

Genuine question: Why release this the day after Mythos? It does not appear SOTA (just based on benchmarks). OpenAI will likely release Spud tomorrow.

by hvass1775672369

The second paragraph starts "Muse Spark is the first step on our scaling ladder and the first product of a ground-up overhaul of our AI efforts. To support further scaling, we are making strategic investments..."

This article is about Meta, not about the user. Who signs off on these? Is the intended audience other people at Meta, not the user?

by ddp261775665617

Question: since they've rebooted their approach to AI... have they given up on open models? There's no mention of open source or open weights or access to the models beyond their hosted services.

by toddmorey1775666376

Looks like it needs a meta account? As soon you hit enter it wants to log-in. I guess I won't try this any time soon. :)

by binaryturtle1775670598

This would have been an amazing release 6 months ago. But the industry moves so fast, this is a trite release. Maybe it’s best for Meta to sell their superintelligence division. I don’t think Zuck’s vision is particularly compelling.

by gallerdude1775665866

I would like someone to tell me how stupid I am. If I were Meta/Zuck I'd open source a great model the moment my company developed it. This just looks like a pitch to investors, otherwise.

by gritspants1775669630

What is the "BioTIER-refuse" thing mentioned in the "Bioweapons Refusal" graph?

I Googled it and found absolutely nothing.

Well, to be honest, I got 100% of websites containing the French word "boîtier" (box) with a typo.

Even on Google Scholar, the closest match is "BioTiER (Biological Training in Education and Research) Scholars Program", which is at least 10 years old and has nothing to do with that.

Is that an AI-generated image with an AI-generated name that has no physical existence?

by edwcross1775670394

Will experiment with the model. But I am scared of sharing any information with the Zuck ecosystem.

by sidcool1775666091

https://meta.ai/share/pe4HxOfv2Bp

Finding a little bit tricky to evaluate because the harness is unfortunately very, very bad (e.g. search is awful). Can't wait to try this in some real external services where we can see how it performs for real.

Definitely getting ordinary high-quality results, overall. But hard to test agentic behavior and hard to test prose quality, even, when just working off of the default chat interface.

One thing that stands out is that _for_ the quality it feels very, very fast. Perhaps it's just only very lightly loaded right now, but irrespective it's lovely to feel.

I'm quite impressed with the tone overall. It definitely feels much more like Opus than it does, like, GPT or Grok in the sense that the style is conversational, natural and enjoyable.

by tekacs1775670884

Sarcasm aside, tried it (with instant mode), it's an impressive model.

It nailed all the ChatGPT meme gotchas (walk to the carwash, Alice 50 brothers, upside down cup, R's in strawberry, which number is bigger, 9.11 or 9.9?)

I guess all that money poaching OpenAI / Anthropic talent went somewhere...

Now, would I use "Meta Muse Code" or "Muse CoWork" if I have to have a facebook account to all of my developers? Maybe not.

Would I use it via an API key? I might, depends on the pricing!

by eranation1775673126

Can't login. No error message in the UI. But the URL changes to "https://www.meta.ai/?error=Token%20exchange%20failed".

by cvhc1775675417

So this is why Anthropic rushed the weirdest "pre-responsible-disclosure-totally-not-for-marketing" announcement yesterday? To make sure Spark doesn't steal their thunder? (Spark beats Opus 4.6 on some benchmarks...). Or did I become a bitter cynical old man.

by eranation1775668715

It is unfortunate that they decided to stop doing open-weight releases.

What could have been interesting has been reduced to simply another subpar LLM release.

by GalaxyNova1775672935

"we hope to open-source future versions of the model."

Love to see it. Cheers!

by khurdula1775670741

Saying nothing about the actual performance of this model, it does strike me how .... minimal(?) this announcement is. Their safety section is like 2 paragraphs about bioweapons. Go look at the reports for OpenAI and Anthropic's model releases. It's like 50+ pages of tests, examples, reports, and benchmarks across a bunch of safety and wellfare metrics.

If Meta wants to be seen as a cutting edge massive lab they need to come across as one instead of looking like a school project version of a frontier model.

by nharada1775670494

> Muse Spark is available today at meta.ai and the Meta AI app. We’re opening a private API preview to select users.

by zurfer1775665709

Kinda crazy, it really felt like Meta had the lead in LLMs, especially during the early LLaMa days. What happened for them to fall so far behind? I don’t get how LLaMa 4 was such a big train wreck and they couldn’t correct the course like Google.

by syntaxing1775672742

Personal Superintelligence made me think this was an open-source model being released and I was excited. Then I continued reading and I'll just wait until the model comes out.

by chankstein381775667170

Looks like a lightweight article. But memory usage went from 316MB -> 502 MB when I hit refresh. Not sure why? Any one have any ideas? Why does it need half a gig of ram in the first place?

by plombe1775672907

so glad its beating all the others on bioweapons refusal. this is what i most wanted out of the latest SOTA model

by oliver2361775667036

Oh good, if they built a lab, I’m sure they took the time the precisely define what they mean by super intelligence? Right? …

by khalic1775665808

I'm cautiously waiting for the feedback from the first users. Meta has produced a lot of great models (LLama), maybe this is a comeback... but I'm cautious, as the jump in the quality is almost too high.

Also, I think people aren't used that using such models requires meta.ai or meta ai app.

by Artgor1775665783

https://meta.ai/ this is where you can try it seems like the API is not publicly accessable yet. I feel they are very late to the game and do not show value to customers over other models.

by visioninmyblood1775666604

by 1775668952

I can't login. It sends me always the same code and it's not correct for them

by napolux1775671426

This looks like a very interesting model and very promising, especially after llama lost so much ground recently. I hope they release the weights

by santiagobasulto1775666475

So Meta is not releasing open source models anymore?

by chrsw1775666014

Sad to see it's not going to be open source.

by dhruvyads1775673089

I'm struck by all these independent announcements saying "look at our new model that we only spent $N Billion in acquisitions and hardware time to build and operate that's just like those other ones but this one is ours." Because if any of these companies would simply pool resources and work together, and if the government actively participated in providing funds, they'd be able to accelerate AI so much faster. It all feels incredibly wasteful. But I guess that's communism or something.

by BugsJustFindMe1775673634

I have to create meta account to access. No thanks.

by vinni21775669417

NOTHING about this is personal! No weights were released!

by nubg1775672102

Hoping the benchmarks are correct this time...

by warthog1775665525

So does this confirm the end of llama?

by ComputerGuru1775667240

Their product could literally teleport gold into my hands and I wouldn't use it.

by nathan_compton1775674969

Associated Meta news post with consumer-friendly takes: https://about.fb.com/news/2026/04/introducing-muse-spark-met...

by ChrisArchitect1775667443

did they just copy the chatgpt ui?

by jansport1231775668301

> Meta AI isn't available yet in your country

Not my loss, will keep using DeepSeek then. Wake me up when my country is no longer in the wrong/right side of history.

by Kuyawa1775669174

Until you actually try the model itself, assume any benchmark presented to you as being part of the marketing material of the model, as it is not independently verified and completely biased.

The same is true with any other model, unless otherwise stated.

In the next few days, we'll see who Meta has paid to promote this model on social media.

by rvz1775666847

The only benchmark they show against SOTA models is in bioweapons refusal.

Edit: nvm I can't read, regular benchmarks against SOTA are there

by OsrsNeedsf2P1775666472

Meta.ai has muse spark

by sidcool1775668362

funny how websites do that thing where it looks like you can use the product but soon as you hit enter, nope login first

by ge961775670242

[dead]

by federicodeponte1775677213

[dead]

by alyxya1775665966

[dead]

by aivillage_team1775672817

I can remember when AOL was an unstoppable giant. Except it wasn't. People eventually realized they could get a better, cheaper, faster experience with ISPs and search engines. The same path is unfolding before Meta. People have much better options, and plethora of Meta users will slowly leave until the big moat is drained. Zuck, go retire to your NZ bunker before Meta is forced to merge with another media company.

by 1970-01-011775668425

How's the metaverse doing? It was the next big thing and how we're all going to be working inside it in... was it like 3 months ago?

Maybe they need to mine more libra coin first? or is it diem now? is that even still part of meta?

I'm sure this new AI is super intelligent and super awesome and will be writing all the code, making all the blog posts, and generating all our youtube shorts in 6 months.

by ehutch791775667635