•

Nvidia released a paper about a 100KB text-to-image model that only trained for 4 minutes but claims to be better than bigger models

Key-Locked Rank One Editing for Text-to-Image Personalization

https://research.nvidia.com/labs/par/Perfusion/

Key-Locked Rank One Editing for Text-to-Image Personalization

They also claim that it only takes about 8 seconds to generate various good images.

•

Might want to clarify: The "model" in this case is not a full model like Stable Diffusion, but rather something used like a patch, more comparable to something like LoRA

I don't think that anyone would misunderstand anyway, but better safe than sorry

astrsk

•

That’s the real meat of this. The future of models will be these smaller, focused “patches” that have some kind of traceable lineage. At least when it comes to marketing and selling these.

hoshikarakitaridia

•

I'm always sceptical about those claims.

Let them prove it, and then we can decide if it's good or not, instead of getting our hopes up for empty promises.

Not the first time ppl have made outlandish claims with AI, even though of course you'd expect someone like Nvidia to be cognisant about this kind of marketing.

zalack

•

NVIDIA's marketing overhypes, but their technical papers tend to be very solid. Obviously it always pays to remain skeptical but they have a good track record in this case.

JackGreenEarth

•

Release it, and let us see. Don't just claim stuff.

ghariksforge

•

Where can we download the model?

ubermeisters

•

Permanently Deleted

AngrilyEatingMuffins

•

Can this be adapted to LLMs?

ubermeisters

•

Permanently Deleted