MultiModel

Grok-1.5 Vision

APRIL 12, 2024 Grok-1.5 Vision Connecting digital & physical worlds with our first multimodal model. Introducing Grok-1.5V, our first-generation multimodal model. In addition to its text capabilities, Grok can now process a wide variety of visual information, including documents, diagrams, charts, screenshots & photographs. Grok-1.5V will be available soon to our early testers and existing Grok users. Capabilities Grok-1.5V is competitive with existing frontier multimodal models in a number of domains, ranging from multi-disciplinary reasoning to understanding documents, science diagrams, charts, screenshots, and photographs. We are particularly excited about Grok’s capabilities in understanding our physical world. Grok outperforms its peers in our new RealWorldQA benchmark that measures real-world spatial understanding. Download; https://x.ai/news/grok-1.5v Donate by Putchasing this NFT. For Developing New Models (!) https://.aistore.ai 🟨




Token ID2
Chain
Ethereum
Contract
Type
ERC721
MetadataIPFS
MediaJPEG