Visual ChatGPT: Combining Talking with Visual Elements — — — — — — — — — — — — —

Microsoft researchers have unveiled Visual ChatGPT, an open-source extension that links an AI dialogue system to numerous Visual Foundation Models, allowing for the exchange of images.

Currently, ChatGPT is incapable of generating or manipulating images; it may well only describe them, which may then be used with tools like Stable Diffusion, DALL-E, or Midjourney. But with the Visual ChatGPT project, the AI system gains the flexibility to supply images, make edits, remove objects, and perform other similar tasks.

Visual ChatGPT allows:

sending and receiving not only languages but additionally images;
providing complex visual questions or visual editing instructions that require the collaboration of multiple AI models with multi-steps;
providing feedback and asking for corrected results.

Find the project on GitHub

Read the paper Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

Know More about GPT: https://twitter.com/BeingOvee/status/1634211136836079618

Find me on: Twitter | Linkedin | Instagram | Facebook | Pinterest

Visual ChatGPT: Combining Talking with Visual Elements — — — — — — — — — — — — —

What are your thoughts on this topic?
Let us know in the comments below.

1 COMMENT

Share this article

Recent posts

AI’s Growing Power Needs: Tech Industry’s Move Towards Nuclear Power

“Human Intelligence Created”… Human Intelligence Challenge Spreads Against ‘Made by AI’

What We Still Don’t Understand About Machine Learning

OpenAI Unveils SearchGPT: A Recent AI-Powered Search Engine

Public Release: Kling AI Video Generator

Visual ChatGPT: Combining Talking with Visual Elements — — — — — — — — — — — — —

What are your thoughts on this topic? Let us know in the comments below.

1 COMMENT

Share this article

Recent posts

What are your thoughts on this topic?
Let us know in the comments below.