Phi-4-Multimodal Playground

This demo allows you to interact with the Phi-4-Multimodal AI model. You can type messages, upload images, or record audio to communicate with the AI. Other demos include Phi-4-Mini playground, Thoughts Organizer, Stories Come Alive, Phine Speech Translator

MultimodalTextbox
Powered by Microsoft Phi-4-multimodal model on Azure AI.©2025

Instructions

  • Type a question or statement
  • Upload images or audio files
  • You can combine text with media files
  • Support 2 modalities at the same time
  • The model can analyze images and transcribe audio
  • For best results with images, use JPG or PNG files
  • For audio, use WAV, MP3, or FLAC files

Capabilities

This chatbot can:

  • Answer questions and provide explanations
  • Describe and analyze images
  • Transcribe, translate, summarize, and analyze audio content
  • Process multiple inputs in the same message
  • Maintain context throughout the conversation