Grok-1.5 Vision

🟨

1/1

APRIL 12, 2024 Grok-1.5 Vision Connecting digital & physical worlds with our first multimodal model. Introducing Grok-1.5V, our first-generation multimodal model. In addition to its text capabilities, Grok can now process a wide variety of visual information, including documents, diagrams, charts, screenshots & photographs. Grok-1.5V will be available soon to our early testers and existing Grok users. Capabilities Grok-1.5V is competitive with existing frontier multimodal models in a number of domains, ranging from multi-disciplinary reasoning to understanding documents, science diagrams, charts, screenshots, and photographs. We are particularly excited about Grok’s capabilities in understanding our physical world. Grok outperforms its peers in our new RealWorldQA benchmark that measures real-world spatial understanding. Download; https://x.ai/news/grok-1.5v Donate by Putchasing this NFT. For Developing New Models (!) https://.aistore.ai 🟨

Token ID	2
Chain	Ethereum
Contract	0xb5f6···7e7f
Type	ERC721
Metadata	IPFS
Media	JPEG

Grok-1.5 Vision

Activity

Collectors

Details