Overview

Models

Compare Diffio models and choose the right balance of speed and clarity.

Diffio offers three speech restoration models, each tuned for a different balance of speed, cost, and clarity. Choose a model for every generation, and run multiple generations on the same project to compare results.

Model lineup

Use the model id in the SDK model field or call the matching REST endpoint. Each model focuses on speech, with different performance and fidelity targets.

Diffio 2.0

diffio-2

Balanced

Fast, reliable cleanup for everyday restoration.

Best for: General restoration, quick turnaround, high volume work.

  • Fastest turnaround in the product lineup.
  • Reliable cleanup for interviews and podcasts.
  • Best fit for high volume production queues.

Diffio 3.5

diffio-3.5

Best

Best quality reconstruction for demanding speech.

Best for: Critical clarity work and detailed transcripts.

  • Improved clarity in noisy rooms.
  • Sharper consonants and tighter detail.
  • Stronger alignment for transcript review.

Choosing a model

Start with a fast preview, then move to higher fidelity for final delivery. Keep the same project and create a new generation each time you change models.

  • diffio-2 for rapid previews or high volume queues.
  • diffio-2 for balanced speed and clarity when throughput matters.
  • diffio-3.5 for the highest clarity on noisy or distant speech.
Plan access

Diffio 2.0 and Diffio 3.5 require a paid plan. Review billing details for plan options.

Endpoint mapping

Model ids map to specific generation endpoints. Use these paths when calling the REST API directly.

Model idLabelEndpoint
diffio-2Diffio 2.0POST /v1/diffio-2.0-generation
diffio-3.5Diffio 3.5POST /v1/diffio-3.5-generation

Next steps