Dataset Viewer
The dataset could not be loaded because the splits use different data file formats, which is not supported. Read more about the splits configuration. Click for more details.
Couldn't infer the same data file format for all splits. Got {NamedSplit('train'): ('json', {}), NamedSplit('validation'): (None, {}), NamedSplit('test'): ('json', {})}
Error code:   FileFormatMismatchBetweenSplitsError

Need help to make the dataset viewer work? Make sure to review how to configure the dataset viewer, and open a discussion for direct support.

Satellite Disruption Triage Aux v2.1

Self-contained real-image repair of ChrisRPL/satellite-disruption-triage-aux-v2.

This version keeps only resolvable BRIGHT real-image rows in VLM SFT files. Synthetic reasoning rows are separated, SEN12MSCR is excluded because the license is unknown, and xBD-Ukraine rows from v2 are excluded because their image references are not resolvable in the stated source repo.

Files

  • train_flat.jsonl / train_sft.jsonl: real-image train rows
  • eval_flat.jsonl / eval_sft.jsonl: event-held-out real-image eval rows
  • eval_calibration_flat.jsonl / eval_calibration_sft.jsonl: reduced real-image calibration rows
  • synthetic_reasoning_flat.jsonl / synthetic_reasoning_sft.jsonl: metadata-only rows, not for VLM SFT
  • excluded_unknown_license.jsonl: SEN12MSCR rows
  • excluded_unresolvable_references.jsonl: xBD-Ukraine rows with bad refs

Counts

  • train_flat.jsonl: 2665
  • eval_flat.jsonl: 381
  • eval_calibration_flat.jsonl: 68

Validation

  • overall_pass: True
  • image_count: 6228
  • action_balance: {"eval_calibration_flat.jsonl": {"defer": 26, "discard": 16, "downlink_now": 26}, "eval_flat.jsonl": {"defer": 124, "discard": 110, "downlink_now": 147}, "train_flat.jsonl": {"defer": 811, "discard": 916, "downlink_now": 938}}

Limitations

  • BRIGHT uses optical baseline to SAR current imagery, so modality artifacts remain a risk.
  • License is CC-BY-NC-4.0 inherited from BRIGHT; non-commercial use only.
  • Bboxes are still source-derived/synthetic approximations, not manual polygons.
  • Calibration split is no longer balanced after filtering to real-image, license-safe rows.
  • Intended for civilian disruption triage only; not for tactical targeting, strike support, or military asset ranking.
Downloads last month
73