stringforge.vulcan.VulcanMLView

Contents

stringforge.vulcan.VulcanMLView#

class stringforge.vulcan.VulcanMLView(reader, *, fractions=(0.8, 0.1, 0.1), splits=('train', 'val', 'test'), salt='', feature_spec=None)#

Bases: object

Train/val/test view of a Vulcan source.

Wraps a stringforge.vulcan.VulcanReader with deterministic, geometry-disjoint splits and a configurable FeatureSpec. Two consumption paths:

  • as_dataframe() – materialise a pandas DataFrame for light workflows.

  • as_hf_dataset() – materialise a streaming datasets.Dataset (requires the datasets package).

Parameters:
  • reader (VulcanReader) – Source reader.

  • fractions (Sequence[float]) – Per-split fractions (must sum to 1.0).

  • splits (Sequence[str]) – Split labels.

  • salt (str) – Optional salt for cross-validation folds.

  • feature_spec (Optional[FeatureSpec]) – Default FeatureSpec; can be overridden per call.

See the module page for the curated public method index. The class page is kept as the canonical object target for cross-references and the general index.