pfd.op package
Submodules
pfd.op.collect module
- class pfd.op.collect.CollectData(*args, **kwargs)[source]
Bases:
OPCollect and process molecular systems data for machine learning workflows.
This operation aggregates multiple atomic systems, applies optional sampling, and converts them to dpdata.MultiSystems format for downstream ML training. Supports both labeled and unlabeled data with optional train/test splitting.
Examples
>>> collector = CollectData() >>> result = collector.execute({ ... "systems": [Path("system1"), Path("system2")], ... "type_map": ["H", "O"], ... "optional_parameters": {"test_size": 0.2} ... })
- execute(ip: OPIO) OPIO[source]
Execute the OP.
- Parameters:
ip (dict) – Input dict with components: - structures : (Artifact(List[Path])) configurations collected in this iteration - pre_structures : (Artifact(Path), optional) A single extxyz file, configurations collected in previous iterations at the CURRENT stage
- Returns:
op – Output dict with components: - task_names: (List[str]) The name of tasks. Will be used as the identities of the tasks. The names of different tasks are different. - task_paths: (Artifact(List[Path])) The parepared working paths of the tasks. Contains all input files needed to start the LAMMPS simulation. The order fo the Paths should be consistent with op[“task_names”]
- Return type:
dict
pfd.op.converge module
pfd.op.inference module
pfd.op.model_test module
pfd.op.pert_gen module
pfd.op.select_confs module
- class pfd.op.select_confs.SelectConfs(*args, **kwargs)[source]
Bases:
OPSelect configurations from exploration trajectories for labeling.
- execute(ip: OPIO) OPIO[source]
Execute the OP.
- Parameters:
ip (dict) –
Input dict with components:
conf_selector: (ConfSelector) Configuration selector.
confs: (List[str]) The exploration trajectories.
init_confs: (Artifact(List[Path])) The initial configurations.
pre_confs: (Artifact(List[Path])) The trajectories generated in the exploration.
optional_parameters: (Dict) The optional parameters
- Returns:
Output dict with components: - confa: (Artifact(Path)) The selected configurations.
- Return type:
Any
pfd.op.stage module
- class pfd.op.stage.StageSchedulerDist[source]
Bases:
StageScheduler
- class pfd.op.stage.StageSchedulerFT[source]
Bases:
StageScheduler