multimodal
-
Blog
OpenAI expands multi-modal capabilities with updated text-to-video model – Computerworld
“The true power of GenAI will be in realizing its multi-model capabilities,” said Sharath Srinivasamurthy, associate vice president at IDC. “Since OpenAI was lagging behind its competitors in text to video, this move was needed to stay relevant and compete.” However, both Google and Meta outpaced OpenAI in making their models publicly reviewable, even though Sora was first introduced in…
Read More » -
Blog
Mistral releases ‘Pixtral 12B,’ its first multimodal AI model – Computerworld
French AI startup Mistral has released its first multimodal model, the Pixtral 12B, which can handle both text and images, according to Techcrunch. The model uses 12 billion parameters and is based on Mistral’s Nemo 12B text model. Pixtral 12B can answer questions about images via URLs or images encoded with base64 such as how many copies of a certain object…
Read More »