T O P

  • By -

stupid_man_costume

they need to specify stable diffusion 3 large vs medium models. the one the released is.. ya


pxp121kr

they do, when you select the right or left image, on the top the model name shows up. I encountered a few times Stable Diffusion 3 Medium and sometimes Stable Diffusion 3 (maybe this is the large?) i think they haven't collected enough results yet to make the Stable Diffusion 3 medium show up on the leaderboard


stupid_man_costume

i meant on the leaderboard


Charuru

For stable diffusion they need more finetunes… the reason why people prefer it sometimes is because the finetunes are so strong


Fastizio

There is a text to speech leaderboard as well from LMSYS, Elevenlabs is destroying the competition as expected.


AdAnnual5736

Today I learned there’s such a thing as Dall e 3 HD. How does someone access that?


Shandilized

Only through the API, the app does not offer it. It's twice as taxing on the servers so they won't include that in the ChatGPT Plus subscription. You can specify the quality when calling the API. Just change the quality to hd instead of standard, like so; from openai import OpenAI client = OpenAI() response = client.images.generate( model="dall-e-3", prompt="a white siamese cat", size="1024x1024", quality="hd", n=1, ) image_url = response.data[0].url Be aware that the price is doubled. 8 cents an image instead of 4 cents an image. It quickly adds up.


AdAnnual5736

Thanks!


Shandilized

You're welcome! :)


bearbarebere

!remindme 2 days


RemindMeBot

I will be messaging you in 2 days on [**2024-06-15 00:26:04 UTC**](http://www.wolframalpha.com/input/?i=2024-06-15%2000:26:04%20UTC%20To%20Local%20Time) to remind you of [**this link**](https://www.reddit.com/r/singularity/comments/1dejx97/today_i_learned_that_there_is_an_image_generation/l8cqxjd/?context=3) [**3 OTHERS CLICKED THIS LINK**](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5Bhttps%3A%2F%2Fwww.reddit.com%2Fr%2Fsingularity%2Fcomments%2F1dejx97%2Ftoday_i_learned_that_there_is_an_image_generation%2Fl8cqxjd%2F%5D%0A%0ARemindMe%21%202024-06-15%2000%3A26%3A04%20UTC) to send a PM to also be reminded and to reduce spam. ^(Parent commenter can ) [^(delete this message to hide from others.)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Delete%20Comment&message=Delete%21%201dejx97) ***** |[^(Info)](https://www.reddit.com/r/RemindMeBot/comments/e1bko7/remindmebot_info_v21/)|[^(Custom)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5BLink%20or%20message%20inside%20square%20brackets%5D%0A%0ARemindMe%21%20Time%20period%20here)|[^(Your Reminders)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=List%20Of%20Reminders&message=MyReminders%21)|[^(Feedback)](https://www.reddit.com/message/compose/?to=Watchful1&subject=RemindMeBot%20Feedback)| |-|-|-|-|


purgebylight

Amazon... of course.


VancityGaming

Isn't SD3 getting roasted by the community? Go look at the subreddit. Why is it ranked so high on the leaderboard? Cherry picked images?


Nyao

SD3 can actually produce good images, except for human anatomy where it produces abominations


7734128

The API version, which has been accessible for a while, is significantly better.


isffo

Besides the other stuff the site doesn't have you choose your own prompts, and their prompts have hardly any humans or animals doing stuff other than being in portrait, so it's softballing anatomy.


Shandilized

Was thinking the same. If anything, this has opened my eyes to just how relative these leaderboards are and how they are meant to be taken with a sizable grain of salt. It's been a loooooong long time since I've ever been a Midjourney subscriber so I can't compare that model, but Dall-E 3 is still leaps and bounds better than SD3. [A Rubik's cube watering can by SD3](https://i.imgur.com/N4aFb0o.jpeg) [A Rubik's cube watering can by Dall-E 3](https://i.imgur.com/VKgrFFr.jpeg) I'll probably be taking the leaderboards (whether it's for LLM's, image or speech) a lot less serious henceforth. I did realize that they are not meant to be blindly and literally interpreted as a fact, and that the ranking is not an absolute verdict of a model. But it's still eye-opening that the ranking can be *that* far off of the reality; I did not at all expect the gap to be *that* huge. The SD3 watering can example is something that the very early versions of Midjourney, or Dall-E2, would come up with.


radiopelican

honestly it's a bit weird seeing stable diffusion up there. in 2017 I used to work for Emad the CEO (EX-ceo?) of stable diffusion. This year I've had people from forbes, and a few asset management firms reach out to me to speak I had no idea about stable diffusion before they spoke with me honestly.