NVIDIA B200s are live on Lambda Cloud! Set up your Demo today! 
meta-logo

Llama 4 Scout 17B 16E Instruct

The Llama 4 Scout 17B 16E Instruct model is an image-text-to-text model designed to process and generate text based on image and text inputs. It is part of the Llama 4 series of models.

This model is available on Lambda Inference.

Price per 1M input tokens
$0.08
Price per 1M output tokens
$0.30
Context
1M

Introduction

The Llama 4 Scout 17B 16E Instruct model is a type of LLaMA4 model, which is a library of large language models developed for various natural language processing tasks. This particular model is designed for image-text-to-text tasks, leveraging the transformers library for efficient and effective processing. The model's architecture is based on a combination of image and text inputs to generate relevant text outputs. The Llama 4 Scout 17B 16E Instruct model is intended for use cases that require the processing of multimodal inputs, such as images and text, to generate informative and relevant text responses. The model was developed with a focus on providing accurate and informative outputs, and its design goals prioritize flexibility and adaptability in a range of applications.

* This content was generated using the llama-4-scout-17b-16e-instruct model via Lambda AI Inference