Supported Models (Overview)¶

This page is the model selection entry point. Use it to answer one question quickly: which model IDs should I shortlist for this task?

Once you have a shortlist, use Advanced Model Reference for side-by-side preprocessing and temporal details, then open the linked detail page for the exact contract, caveats, and examples. If you are about to change input_prep, variant, fetch resolution, patch size, or image size, read Before You Start first, because those knobs affect both runtime cost and embedding semantics.

How To Read This Page¶

Start with the quick chooser, then scan the catalog table for input and temporal fit, and open the detail page before benchmarking or production use.

Canonical model IDs use the short public names shown on this page, such as remoteclip, prithvi, terrafm, and thor. Some detail-page filenames still use older names for compatibility, but the canonical IDs above are the names users should copy into code.

Quick Chooser by Goal¶

Goal	Good starting models	Why
Fast baseline / simple pipeline	`tessera`, `gse`, `copernicus`	Precomputed embeddings, fewer runtime dependencies
Simple S2 RGB on-the-fly experiments	`remoteclip`, `satmae`, `satmaepp`, `scalemae`	Straightforward RGB input paths
Time-series temporal modeling	`agrifm`, `anysat`, `galileo`	Native multi-frame temporal packaging
Multispectral / strict spectral semantics	`satmaepp_s2_10b`, `dofa`, `terramind`, `thor`, `satvision`	Strong channel/schema assumptions
Mixed-modality experiments (S1/S2)	`terrafm`, `thor`	Supports S2 or S1 path (per call)

Model Catalog Snapshot¶

Precomputed Embeddings¶

Model ID	Type	Primary Input / Source	Default Resolution	Dim	Temporal mode	Notes	Detail
`tessera`	Precomputed	GeoTessera embedding tiles	10m	128	yearly coverage product	Fast baseline, source-fixed precomputed workflow; product-native fixed CRS	detail
`gse`	Precomputed	Google Satellite Embedding (annual)	10m	64	`TemporalSpec.year(...)`	Annual product via provider path	detail
`copernicus`	Precomputed	Copernicus embeddings	0.25°	768	limited (2021)	Coarse resolution product on fixed EPSG:4326 grid	detail

On-the-fly Foundation Models¶

Model ID	Primary Input	Dim	Default Resolution	Temporal style	Notable requirements	Detail
`remoteclip`	S2 RGB (`B4,B3,B2`)	512	10m	single composite	CLIP projection; RGB preprocessing	detail
`satmae`	S2 RGB (`B4,B3,B2`)	1024	10m	single composite	ViT-L; MAE token/grid	detail
`satmaepp`	S2 RGB (`B4,B3,B2`)	1024	10m	single composite	ViT-L; fMoW eval preprocessing	detail
`satmaepp_s2_10b`	S2 10-band	1024	10m	single composite	strict band order; grouped-channel tokens	detail
`scalemae`	S2 RGB + scale	1024	10m	single composite	`sensor.scale_m` is a model input	detail
`wildsat`	S2 RGB	256	10m	single composite	biodiversity training; image_head default	detail
`prithvi`	S2 6-band	768	30m	single composite	required temporal + location side inputs	detail
`terrafm`	S2 12-band or S1 VV/VH	768	10m	single composite	dual-modality by channel count	detail
`terramind`	S2 12-band	384	10m	single composite	ViT-S class; strict z-score normalization	detail
`dofa`	Multispectral + wavelengths	768	10m	single composite	wavelength vector required	detail
`fomo`	S2 12-band	768	10m	single composite	per-channel spectral modality keys	detail
`thor`	S2 10-band or S1 VV/VH	768	10m	single composite	dual-modality; grouped tokens; native-snap	detail
`satvision`	TOA 14-channel (MODIS)	4096	1000m	single composite	SwinV2 Giant; strict channel calibration	detail
`anysat`	S2 10-band time series	768	10m	multi-frame	JEPA; `s2_dates` DOY side input	detail
`galileo`	S2 10-band time series	128	10m	multi-frame	nano default; month tokens	detail
`agrifm`	S2 10-band time series	1024	10m	multi-frame	Video Swin; fixed `T` frame stack	detail

Temporal and Comparison Notes (What People Usually Miss)¶

TemporalSpec.range(start, end) is usually a compositing window rather than a single-scene selector, and OutputSpec.grid() may be a token or patch grid rather than a georeferenced raster, especially for ViT-like backbones. Cross-model comparisons are usually easiest with OutputSpec.pooled() plus fixed ROI, temporal, and compositing settings.

Precomputed products can also keep their own product-native projection instead of the common provider-backed EPSG:3857 sampling grid. Today that matters especially for tessera and copernicus, so check each detail page before comparing grid outputs directly against on-the-fly models.

On this page, "Default Resolution" means the default source-side fetch resolution, not the final resized tensor shape sent into the backbone. Multi-frame models such as agrifm, anysat, and galileo also need extra attention to frame count and temporal side inputs.

Read the details in Supported Models (Advanced Reference).

More Detail¶

For cross-model preprocessing, temporal packaging, and environment knobs, continue to Advanced Model Reference. For user-facing guidance on how to trade compute for quality, spatial detail, or temporal fidelity, read Before You Start. If you are adding a new adapter, use Extending to keep the implementation and documentation consistent.