![]() |
Image: Google |
Google has announced the rollout of two new advanced capabilities for its Gemini AI chatbot. The features, which were first previewed at the Google I/O earlier this year, include the AI agent Gems and image generation capabilities of the recently released Imagen 3 AI model.
The AI agent Gems will be available to Gemini Advanced, Business, and Enterprise users, while the Imagen 3 features will be shipped to all users, including those on the free tier. However, users on the free version may see some added limits to image generation.
Gems are miniature versions of the chatbot with a limited dataset, allowing them to focus on specific topics and generate more specific and accurate information. Users can customize Gems to create a team of experts to help with challenging projects, brainstorm ideas, or write social media posts. Gems will be available in multiple languages on desktop and mobile devices in over 150 countries.
Imagen 3, Google's latest image generation AI tool, can generate images in different styles, such as Nikon DSLR, GoPro style, wide-angle lens, and more. It can also generate photorealistic landscapes, textured oil paintings, or whimsical claymation scenes. The AI model has been upgraded to include the generation of images of people, with added safeguards to reduce the risk of deepfakes. SynthID has been used to watermark the images as generated by AI.
The rollout of Imagen 3 capabilities may also include inline editing of generated images using text prompts. However, it appears that editing can only be done using text prompts. Google has specified that Imagen 3 will not support the generation of photorealistic, identifiable individuals, depictions of minors, or excessively gory, violent, or sexual scenes.
The integration of Gems and Imagen 3 into the Gemini apps is part of Google's efforts to enhance its AI capabilities and provide users with more advanced tools for image generation and chatbot interactions.
No comments:
Post a Comment