smftools.informatics.pod5_functions#
Functions
|
Basecall POD5 inputs using a configuration file. |
|
Convert FAST5 inputs into a single POD5 file. |
|
Write a subsampled POD5 containing selected reads. |
|
Randomly sample up to max_reads reads from pod5 inputs and write a temp pod5. |
- smftools.informatics.pod5_functions.basecall_pod5s(config_path)#
Basecall POD5 inputs using a configuration file.
- smftools.informatics.pod5_functions.fast5_to_pod5(fast5_dir, output_pod5='FAST5s_to_POD5.pod5')#
Convert FAST5 inputs into a single POD5 file.
- Parameters:
- Raises:
FileNotFoundError -- If no FAST5 files are found or the input path is invalid.
- Return type:
- smftools.informatics.pod5_functions.subsample_pod5(pod5_path, read_name_path, output_directory)#
Write a subsampled POD5 containing selected reads.
- Parameters:
- Return type:
- smftools.informatics.pod5_functions.subsample_pod5_for_basecalling(input_path, max_reads, output_dir, seed=42)#
Randomly sample up to max_reads reads from pod5 inputs and write a temp pod5.
Collects read IDs from all pod5 files first (memory-efficient), then samples, then writes only the selected reads. If the total read count is already <= max_reads the original input_path is returned unchanged.
- Parameters:
- Return type:
- Returns:
Path to the (possibly new) pod5 file to use for basecalling.