Data Preprocessing
oh no
calculate_class_weights
calculate_class_weights (dataloader, ignore_index=-100, returns_padded_mask=True, return_ratio=True)
interpolate_nan_clip
interpolate_nan_clip (x_in, physiological_range_clip=None, percentile_clip=None, return_mask_only=False)
Function to clip outliers based on percentiles or physiological range and then interpolate nearby values
calculate_stats_all
calculate_stats_all (zarr_files, channels, sample_wise=True, clip_interpolations=None, channel_magnitude_multiple=None)
calculate_stats
calculate_stats (idx, zarr_file, channels, clip_interpolations=None, channel_magnitude_multiple=None)
Function to caluclate stats on an individual zarr array, including a clip interpolate range
calculate_samples_mp
calculate_samples_mp (zarr_files, channels, frequency, sample_seq_len_sec, stride_sec, start_offset_sec=None, max_seq_len_sec=None, include_partial_samples=True, nan_tolerance=0.0)
Multiprocessing function to generate samples
calculate_samples
calculate_samples (idx, zarr_file, channels, frequency, sample_seq_len_sec, stride_sec, start_offset_sec=None, max_seq_len_sec=None, include_partial_samples=True, nan_tolerance=0.0)
Function to create a dataframe of samples and their sequence indices