stupid_man_costume 1 month ago

they need to specify stable diffusion 3 large vs medium models. the one the released is.. ya

pxp121kr 1 month ago

they do, when you select the right or left image, on the top the model name shows up. I encountered a few times Stable Diffusion 3 Medium and sometimes Stable Diffusion 3 (maybe this is the large?) i think they haven't collected enough results yet to make the Stable Diffusion 3 medium show up on the leaderboard

stupid_man_costume 1 month ago

i meant on the leaderboard

Charuru 1 month ago

For stable diffusion they need more finetunes… the reason why people prefer it sometimes is because the finetunes are so strong

Fastizio 1 month ago

There is a text to speech leaderboard as well from LMSYS, Elevenlabs is destroying the competition as expected.

AdAnnual5736 1 month ago

Today I learned there’s such a thing as Dall e 3 HD. How does someone access that?

Shandilized 1 month ago

Only through the API, the app does not offer it. It's twice as taxing on the servers so they won't include that in the ChatGPT Plus subscription. You can specify the quality when calling the API. Just change the quality to hd instead of standard, like so; from openai import OpenAI client = OpenAI() response = client.images.generate( model="dall-e-3", prompt="a white siamese cat", size="1024x1024", quality="hd", n=1, ) image_url = response.data[0].url Be aware that the price is doubled. 8 cents an image instead of 4 cents an image. It quickly adds up.

AdAnnual5736 1 month ago

Thanks!

Shandilized 1 month ago

You're welcome! :)

bearbarebere 1 month ago

!remindme 2 days

RemindMeBot 1 month ago

I will be messaging you in 2 days on [**2024-06-15 00:26:04 UTC**](http://www.wolframalpha.com/input/?i=2024-06-15%2000:26:04%20UTC%20To%20Local%20Time) to remind you of [**this link**](https://www.reddit.com/r/singularity/comments/1dejx97/today_i_learned_that_there_is_an_image_generation/l8cqxjd/?context=3) [**3 OTHERS CLICKED THIS LINK**](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5Bhttps%3A%2F%2Fwww.reddit.com%2Fr%2Fsingularity%2Fcomments%2F1dejx97%2Ftoday_i_learned_that_there_is_an_image_generation%2Fl8cqxjd%2F%5D%0A%0ARemindMe%21%202024-06-15%2000%3A26%3A04%20UTC) to send a PM to also be reminded and to reduce spam. ^(Parent commenter can ) [^(delete this message to hide from others.)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Delete%20Comment&message=Delete%21%201dejx97) ***** |[^(Info)](https://www.reddit.com/r/RemindMeBot/comments/e1bko7/remindmebot_info_v21/)|[^(Custom)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5BLink%20or%20message%20inside%20square%20brackets%5D%0A%0ARemindMe%21%20Time%20period%20here)|[^(Your Reminders)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=List%20Of%20Reminders&message=MyReminders%21)|[^(Feedback)](https://www.reddit.com/message/compose/?to=Watchful1&subject=RemindMeBot%20Feedback)| |-|-|-|-|

purgebylight 1 month ago

Amazon... of course.

VancityGaming 1 month ago

Isn't SD3 getting roasted by the community? Go look at the subreddit. Why is it ranked so high on the leaderboard? Cherry picked images?

Nyao 1 month ago

SD3 can actually produce good images, except for human anatomy where it produces abominations

7734128 1 month ago

The API version, which has been accessible for a while, is significantly better.

isffo 1 month ago

Besides the other stuff the site doesn't have you choose your own prompts, and their prompts have hardly any humans or animals doing stuff other than being in portrait, so it's softballing anatomy.

Shandilized 1 month ago

Was thinking the same. If anything, this has opened my eyes to just how relative these leaderboards are and how they are meant to be taken with a sizable grain of salt. It's been a loooooong long time since I've ever been a Midjourney subscriber so I can't compare that model, but Dall-E 3 is still leaps and bounds better than SD3. [A Rubik's cube watering can by SD3](https://i.imgur.com/N4aFb0o.jpeg) [A Rubik's cube watering can by Dall-E 3](https://i.imgur.com/VKgrFFr.jpeg) I'll probably be taking the leaderboards (whether it's for LLM's, image or speech) a lot less serious henceforth. I did realize that they are not meant to be blindly and literally interpreted as a fact, and that the ranking is not an absolute verdict of a model. But it's still eye-opening that the ranking can be *that* far off of the reality; I did not at all expect the gap to be *that* huge. The SD3 watering can example is something that the very early versions of Midjourney, or Dall-E2, would come up with.

radiopelican 1 month ago

honestly it's a bit weird seeing stable diffusion up there. in 2017 I used to work for Emad the CEO (EX-ceo?) of stable diffusion. This year I've had people from forbes, and a few asset management firms reach out to me to speak I had no idea about stable diffusion before they spoke with me honestly.

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe