Dataset Viewer
The dataset could not be loaded because the splits use different data file formats, which is not supported. Read more about the splits configuration. Click for more details.
Couldn't infer the same data file format for all splits. Got {NamedSplit('train'): ('json', {}), NamedSplit('validation'): (None, {}), NamedSplit('test'): ('json', {})}
Error code: FileFormatMismatchBetweenSplitsError
Need help to make the dataset viewer work? Make sure to review how to configure the dataset viewer, and open a discussion for direct support.
Satellite Disruption Triage Aux v2.1
Self-contained real-image repair of ChrisRPL/satellite-disruption-triage-aux-v2.
This version keeps only resolvable BRIGHT real-image rows in VLM SFT files. Synthetic reasoning rows are separated, SEN12MSCR is excluded because the license is unknown, and xBD-Ukraine rows from v2 are excluded because their image references are not resolvable in the stated source repo.
Files
train_flat.jsonl/train_sft.jsonl: real-image train rowseval_flat.jsonl/eval_sft.jsonl: event-held-out real-image eval rowseval_calibration_flat.jsonl/eval_calibration_sft.jsonl: reduced real-image calibration rowssynthetic_reasoning_flat.jsonl/synthetic_reasoning_sft.jsonl: metadata-only rows, not for VLM SFTexcluded_unknown_license.jsonl: SEN12MSCR rowsexcluded_unresolvable_references.jsonl: xBD-Ukraine rows with bad refs
Counts
train_flat.jsonl:2665eval_flat.jsonl:381eval_calibration_flat.jsonl:68
Validation
- overall_pass:
True - image_count:
6228 - action_balance:
{"eval_calibration_flat.jsonl": {"defer": 26, "discard": 16, "downlink_now": 26}, "eval_flat.jsonl": {"defer": 124, "discard": 110, "downlink_now": 147}, "train_flat.jsonl": {"defer": 811, "discard": 916, "downlink_now": 938}}
Limitations
- BRIGHT uses optical baseline to SAR current imagery, so modality artifacts remain a risk.
- License is CC-BY-NC-4.0 inherited from BRIGHT; non-commercial use only.
- Bboxes are still source-derived/synthetic approximations, not manual polygons.
- Calibration split is no longer balanced after filtering to real-image, license-safe rows.
- Intended for civilian disruption triage only; not for tactical targeting, strike support, or military asset ranking.
- Downloads last month
- 73