In the continued effort to make AI more like humans, OpenAI's GPT models have continually pushed the boundaries. GPT-4 is now able to just accept prompts of each text and pictures.Multimodality in generative AI...
GPT-4 with vision (GPT-4V) enables users to instruct GPT-4 to research image inputs provided by the user, and is the most recent capability we're making broadly available. Incorporating additional modalities (corresponding to image inputs)...