New small language model from Microsoft that can look at images and tell their content
Microsoft announced the new version of its small language model called Phi-3, which can look at images and tell what they contain. Being a multi-mode model, Phi-3-vision can read both text and images and is best used on mobile devices. Microsoft says Phi-3-vision, available now …