AWS Lambda Introduces MicroVMs for Isolated Sandboxed Execution

Today, AWS announced Lambda MicroVMs, a new serverless compute primitive within AWS Lambda that lets you run code generated by users or AI in isolated, stateful execution environments. You get virtual machine level isolation, near-instant launch and resume, and direct control over environment lifecycle and state — all without managing infrastructure or building expertise in complex virtualization technologies. Lambda MicroVMs are powered by Firecracker, the same lightweight virtualization technology that has powered over 15 trillion monthly Lambda function invocations.

Why Customers Need This

Over the past few years, a new class of multi-tenant applications has emerged that all share the need to hand each end user their own dedicated execution environment in which to safely run code that the application developer did not write. AI coding assistants, interactive code environments, data analytics platforms, vulnerability scanners, and game servers that run user-supplied scripts all fit this pattern.

Building that capability today means making a difficult choice. Virtual machines deliver strong isolation but take minutes to start. Containers launch in seconds, yet their shared-kernel architecture requires significant custom hardening to safely contain untrusted code. Functions as a service are optimized for event-driven, request-response workloads, but are not designed for long-running interactive sessions that need to retain environment state across user interactions. That leaves developers either accepting tradeoffs between performance and isolation, or investing significant engineering resources to build and operate custom virtualization infrastructure to achieve isolated execution while delivering low-latency experiences to end users — an effort that demands deep expertise and pulls engineering time away from the product they are actually trying to build.

Lambda MicroVMs is purpose-built for exactly this gap. Each MicroVM gives a single end user or session its own isolated environment that launches rapidly, retains memory and disk state for the length of the session, and pauses to a low idle cost when the user steps away. Because the same Firecracker technology already underpins AWS Lambda Functions, you inherit the operational maturity of a service that has been running this stack at scale.

Getting Started

To get started, navigate to the AWS Lambda console, where Lambda MicroVMs now appears in the left-hand navigation menu. The first step is to create a MicroVM Image.

Package a Flask web app and its Dockerfile into a zip file and upload it to an Amazon S3 bucket.

Flask API — app.py

import logging

from flask import Flask, jsonify

app = Flask(__name__)
logging.basicConfig(level=logging.INFO)


@app.route("/")
def hello():
    app.logger.info("Received request to hello world endpoint")
    return jsonify(message="Hello, World!")


if __name__ == "__main__":
    app.run(host="0.0.0.0", port=5000)

Dockerfile

FROM public.ecr.aws/lambda/microvms:al2023-minimal
RUN dnf install -y python3 python3-pip && dnf clean all

WORKDIR /app

COPY requirements.txt .
RUN pip install --no-cache-dir -r requirements.txt

COPY app.py .

EXPOSE 5000

CMD ["gunicorn", "--bind", "0.0.0.0:5000", "app:app"]

Use the following command to create the MicroVM Image:

aws lambda-microvms create-microvm-image \
--code-artifact uri=<path/to/s3/artifact.zip> --name <VM_image_name> \
--base-image-arn arn:aws:lambda:us-east-1:aws:microvm-image:al2023-1 \
--build-role-arn <IAM role ARN>

MicroVM Image creation in the AWS Console

You can also create the MicroVM Image in the AWS Console as shown above. Once the command runs, Lambda retrieves the zip, runs the Dockerfile, initializes the application, and takes a Firecracker snapshot of the running disk and memory state. Build logs stream in real time to Amazon CloudWatch under /aws/lambda/microvms/<image-name>, and when the image is ready it appears in the console with its Amazon Resource Name (ARN) and version number.

To launch a MicroVM:

aws lambda-microvms run-microvm \
--image-identifier arn:aws:lambda:<region>:<acct>:microvm-image:my-image \
--execution-role-arn arn:aws:iam::<acct>:role/MicroVMExecutionRole \
--idle-policy '{"maxIdleDurationSeconds":900,"suspendedDurationSeconds":300,"autoResumeEnabled":true}'

Launching can also be done via the AWS Console or the CLI. Pass the image ARN and an idle policy configured to auto-suspend after 15 minutes of inactivity and auto-resume on the next incoming request. No networking setup is required. Lambda assigns the MicroVM a unique ID, returns a dedicated endpoint URL, and starts a new MicroVM with the Flask app already running, since it is resumed from a snapshot. One API call delivers a fully initialized, bootstrapped compute environment.

MicroVM running in the AWS Console

To send traffic, generate a short-lived auth token with the CLI and attach it to a plain HTTPS request using the X-aws-proxy-auth header. The request lands on the Flask app immediately. After the MicroVM sits idle past the suspend threshold, it is suspended with its memory and disk state snapshotted and stored. On the next request, it resumes with the application state fully intact. From the client side, the pause is transparent.

MicroVM suspend and resume behavior

How It Works

Lambda MicroVMs delivers three capabilities that, until now, no single AWS compute service offered together.

Virtual machine level isolation comes from Firecracker. Each session runs in its own dedicated MicroVM with no shared kernel and no shared resources between users, so untrusted code supplied by one user is contained to their execution environment, without access to other environments or the underlying system.

Rapid launch and resume is based on an image-then-launch model. You create a MicroVM Image by supplying a Dockerfile and code packaged as a zip artifact in Amazon S3. Lambda runs the Dockerfile, initializes the application, and takes a Firecracker snapshot of the running environment’s memory and disk state. Every subsequent MicroVM launched from that image resumes from the pre-initialized snapshot rather than booting cold, which means launches and idle resumes both achieve near-instant startup latency. Even a multi-gigabyte interactive session comes back online quickly enough to feel responsive to the end user.

Stateful execution means a running MicroVM retains memory, disk, and running processes across the user’s session. During idle periods, a MicroVM can be suspended — with memory and disk state intact — and resumed when traffic arrives. Installed packages, loaded models, and working filesets are readily available when the user resumes their session. MicroVMs support up to 8 hours of total runtime and can be suspended automatically after a configurable idle window, making it straightforward to build products as varied as software vulnerability scans that complete in minutes, data analytics applications that run for hours, and interactive coding sessions with extended idle periods. Note that because Lambda MicroVMs are started from pre-initialized snapshots, applications generating unique content, establishing network connections, or loading ephemeral data during initialization may need to integrate with service-provided hooks for compatibility.

Lambda MicroVMs is a distinct resource within AWS Lambda with its own API surface. Lambda Functions remain the right choice for event-driven, request-response workloads, while Lambda MicroVMs is purpose-built for multi-tenant applications that need to hand each end user or session their own isolated environment to execute user- or AI-generated code. The two complement each other: an application using Lambda Functions for its event-driven backbone can call into Lambda MicroVMs for steps that need to run untrusted code in isolation.

Availability

AWS Lambda MicroVMs is available today in the US East (N. Virginia, Ohio), US West (Oregon), Europe (Ireland), and Asia Pacific (Tokyo) Regions, on the ARM64 architecture, with up to 16 vCPUs, 32 GB of memory, and 32 GB of disk per MicroVM. Idle MicroVMs can be suspended explicitly through an API call or automatically through a lifecycle policy, reducing running cost while preserving full state for fast resume. Pricing details are available on the AWS Lambda pricing page.

To get started, visit the AWS Lambda console, explore the Lambda MicroVMs product page, or refer to the Lambda MicroVMs Developer Guide for full documentation.

Why Customers Need This

Getting Started

How It Works

Availability

Related Articles

AI Coding Speed Outpaces Governance, GitLab Survey Finds

Anthropic Launches Claude Tag as a Persistent Slack Team Member

Building Durable AI Agents: A Study Guide for Kubernetes and Temporal