Join us on November 9th to learn how to successfully innovate and achieve efficiency by upskilling and scaling citizen developers at the Low-Code/No-Code Summit. Register here.
Get ready, developers: Today, OpenAI released the highly anticipated. DALL-E API in public betaWhich means developers can now integrate DALL-E directly into their apps and products.
With the announcement, DALL-E, a transformer language model that allows users to use natural language notation to create and edit real images, will be added to OpenAI’s API platform for GPT-3, Embeddings and Codex. joins the
Companies like Kala, a fashion design platform, and Mixtiles, which prints images online on lightweight decorative tiles, have already implemented and tested the API for their specific use cases.
Meanwhile, Microsoft is bringing DALL-E to its new graphic design app, Designer — and integrating DALL-E with Image Creator in Bing and Microsoft Edge, allowing users to create images if the web The results do not return what they are looking for. Stock imagery provider Shutterstock also announced last week that it will use the API to offer DALL-E-generated images to users.
Low Code/No Code Summit
Learn how bBuild, scale, and control low-code programs in a straightforward way that makes November 9 a success for all.Register today for your free pass.
OpenAI will continue to iterate on the DALL-E API.
The API will be available for anyone to use on the OpenAI platform, OpenAI product manager Luke Miller told VentureBeat.
With the API in beta, “we’ll continue to iterate and improve through the end of the year,” he said. “We’re really excited about all the ways that developers can take this technology and customize it for specific needs, specific applications and specific communities, so we can scale more than ever before. ”
Miller added that the company has taken many of the lessons learned over the past months of rolling out the DALL-E beta to millions of users and built it into an API, “so that we can feel comfortable sharing it with the world, but Developers focus on the fun aspects of building.
DALL-E’s fast-paced journey to cultural touchstones
The DALL-E API is another big move for the text-to-image generator, which has become part of the mainstream pop culture zeitgeist since the release of DALL-E 2 just six months ago – including the creations by DALL Millions of art ideas. -E on social media, a segment about DALL-E today’s show, And a Recent appearance By CTO Meera Murthy The Daily Show.
At the same time, there has been much noise and intense debate over issues including the possibility of legal wrangling over ownership of the copyright of DALL-E images. How can DALL-E. Reflection of bias In its training data; And Questions Regarding the validity and reliability of the DALL-E.
But OpenAI claims that 3 million people are already using DALL-E to boost creativity and speed up workflow, generating more than 4 million images a day. Developers can now start building in minutes with DALL-E, he says.
From side projects to startups
This includes making it easy to get up and running by signing up, getting an API key and starting to build, Miller explained.
“Whether it’s someone just hacking away at a fun side project on the weekend, whether it’s an early-stage startup, an artist working on a creative project, or a large enterprise, all those people are capable of are able to come in and integrate this technology into their product,” he said, echoing what many are predicting — that the DALL-E API will be first launched by creative AI startups. will open the floodgates.
“The fun hacking side project will eventually become a startup in some respects,” he said. “Ultimately, if you’re excited to build with this technology, we want you to be able to do it and incorporate it into your products.”
Rowan Curran, an AI and ML analyst at Forrester Research, believes that if the DALL-E API allows for image editing and optimization, it will be “hugely useful” for developers.
“You can then add it as a complete application to any enterprise use case,” he told VentureBeat.
API will be priced per image.
The DALL-E API is priced per image output based on size. 1024 x 1024 costs $0.02/Image, while there are very slight discounts for 512 x 512 at $0.018/Image and 256 x 256 at $0.016/Image.
Miller explained that the API has three capabilities. Users can create an image, edit a part of the image, and create multiple variations of the image.
“You can think of it as the opposite of the creative process, coming up with ideas, picking something and cutting it down and then iterating and finding something that works for you,” he said. Be appropriate to the need and given context.”
Historically, Curran pointed out that one of the limiting factors around large language models overall has been the cost involved in running them. So if the DALL-E API is priced right, he said it would “open up a whole set of use cases, especially for startups and people who are getting seed funding.”
That said, he added, large enterprises, especially innovation teams, will likely want to use the DALL-E API as well.
“Also, I expect to see more research and usage at the enterprise level moving forward in terms of adapting and fine-tuning their own big language models for different use cases,” he said. “Because I think the ability to take large language models, add that fine-tuning layer on top for some of these specific industries is where it’s going to be really game-changing.”
Questions about trust and security
Critics continue to question the trust and safety issues of generative AI in general and DALL-E in particular – that fake images can be used to bully and harass, for example, or misinform. To spread and incite violence. In May, The researchers said that the tool It can also reinforce stereotypes against women and people of color.
Those with ethical and legal questions surrounding DALL-E won’t be happy with the news that images generated with the API won’t need watermarks—which were implemented during DALL-E 2 beta but Optional with API.
But in a press release, OpenAI maintained that the DALL-E API is “incorporating the trust and security lessons we’ve learned while deploying DALL-E to 3 million artists and users worldwide. ”
With the API, “developers can ship with confidence knowing that built-in mitigations — such as filters for hate symbols and gore — will handle the challenging aspects of moderation,” the press release continued. “As part of OpenAI’s commitment to responsible deployment, we will continue to make trust and security a top priority so developers can focus on building.”
Mixtiles uses the DALL-E API to create memories.
Eytan Levit, co-founder of Tel Aviv-based Mixtiles, says the company immediately saw the potential of DALL-E 2 and signed up for early access.
“We started playing with the DALL-E 2 to create framed pictures of childhood memories, ‘spirit animals’ and dreams that our family members and friends described to us,” he told VentureBeat. “We wanted to see if they would hang those pictures on their walls, and they did.”
Levitt points out that there is a learning curve for a first-time DALL-E user. “For example, you need to know what styles you can use, such as oil painting, digital art, pencil sketch or watercolor,” he said. “We’ve learned that referring to the time of day materially affects your results, while color palettes also help to get the best images.”
Using an API, Mixtiles’ approach has been to guide the user through a series of steps, each step bringing them closer to creating artwork that resonates with them emotionally.
“We believe simplicity is the key to unlocking this amazing technology to the millions of people who can use it to decorate their homes,” Levitt said.
Ultimately, he added, Mixtiles is betting that generative AI and DALL-E represent a new technological leap, “equal to the invention of paper, the picture frame, the canvas print or computer graphics — we think I think it will spark an explosion. of new use cases, of human creativity and emotional connection.”
For Mixtiles, that means allowing users to upload family photos and portraits and then customize those photos.
“Imagine turning your child’s photo into their favorite superhero, or turning your family portrait into a Simpsons-style character, or a Van Gogh-style painting,” Levitt said. “We are optimistic that generative AI will become an integral part of our value proposition in the near future.”
Mission of VentureBeat To be a digital town square for technology decision makers to learn about transformative enterprise technology and transactions. Explore our briefing.
#DALLE #API #released #public #beta #OpenAI #potential #benefit #app #developers