pvx Algorithm Limitations and Applicability

May 25, 2026 ยท View on GitHub

pvx logo

pvx Algorithm Limitations and Applicability

Generated from commit 35e9761 (commit date: 2026-05-25T08:14:42-04:00).

This document summarizes assumptions, likely failure modes, and practical exclusion cases for each algorithm group and algorithm module.

Group-Level Summary

GroupAssumptionsFailure ModesWhen Not To Use
analysis_qa_and_automationFeature extraction settings align with domain (speech vs music etc.).False positives/negatives under domain shift.Avoid treating single metrics as absolute quality verdicts.
creative_spectral_effectsSpectral manipulations are desired even with timbral coloration.Can introduce intentional but strong coloration or temporal artifacts.Avoid for transparent restoration/mastering paths.
denoise_and_restorationNoise/artifacts are distinguishable from desired signal statistics.Over-reduction can remove detail and create modulation artifacts.Avoid high reduction settings on sparse acoustic sources without auditioning.
dereverb_and_room_correctionLate reverberation is separable from direct content under chosen model.Speech/music clarity can drop if early reflections are over-suppressed.Avoid strong dereverb when room character is part of artistic intent.
dynamics_and_loudnessProgram dynamics fit compressor/limiter time constants and thresholds.Pumping, breathing, or overs if thresholds and release are mis-set.Avoid applying multiple strong dynamics stages without gain staging checks.
granular_and_modulationGrain and modulation rates are musically matched to source texture.Incoherent grain scheduling can produce choppiness or blur.Avoid dense granular settings on speech intelligibility-critical content.
pitch_detection_and_trackingF0 evidence is strong in the selected analysis band and frame size.Octave errors and voicing flips under heavy noise/polyphony.Avoid as the sole control signal for dense polyphonic mixtures.
retune_and_intonationDetected notes map cleanly to intended tonal center/scale.Over-correction can flatten expressive vibrato or slides.Avoid aggressive correction when preserving natural micro-intonation is required.
separation_and_decompositionSources have partially separable spectral or statistical structure.Component bleeding and musical noise under overlap or model mismatch.Avoid expecting perfect stems from strongly correlated or co-modulated sources.
spatial_and_multichannelChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spectral_time_frequency_transformsTransform parameterization matches target time-frequency structure.Incorrect parameterization can smear events or over-fragment spectra.Avoid default settings for highly nonstationary signals without tuning.
time_scale_and_pitch_coreFrames are locally quasi-stationary and harmonic evolution is reasonably smooth.High-ratio stretch can introduce phasiness and blurred transients.Avoid for extreme percussive-only material when attack realism is critical.

analysis_qa_and_automation

Algorithm IDAssumptionsFailure ModesWhen Not To Use
analysis_qa_and_automation.auto_parameter_tuning_bayesian_optimizationFeature extraction settings align with domain (speech vs music etc.).False positives/negatives under domain shift.Avoid treating single metrics as absolute quality verdicts.
analysis_qa_and_automation.batch_preset_recommendation_based_on_source_featuresFeature extraction settings align with domain (speech vs music etc.).False positives/negatives under domain shift.Avoid treating single metrics as absolute quality verdicts.
analysis_qa_and_automation.clip_hum_buzz_artifact_detectionFeature extraction settings align with domain (speech vs music etc.).False positives/negatives under domain shift.Avoid treating single metrics as absolute quality verdicts.
analysis_qa_and_automation.key_chord_detectionFeature extraction settings align with domain (speech vs music etc.).False positives/negatives under domain shift.Avoid treating single metrics as absolute quality verdicts.
analysis_qa_and_automation.onset_beat_downbeat_trackingFeature extraction settings align with domain (speech vs music etc.).False positives/negatives under domain shift.Avoid treating single metrics as absolute quality verdicts.
analysis_qa_and_automation.pesq_stoi_visqol_quality_metricsFeature extraction settings align with domain (speech vs music etc.).False positives/negatives under domain shift.Avoid treating single metrics as absolute quality verdicts.
analysis_qa_and_automation.silence_speech_music_classifiersFeature extraction settings align with domain (speech vs music etc.).False positives/negatives under domain shift.Avoid treating single metrics as absolute quality verdicts.
analysis_qa_and_automation.structure_segmentation_verse_chorus_sectionsFeature extraction settings align with domain (speech vs music etc.).False positives/negatives under domain shift.Avoid treating single metrics as absolute quality verdicts.

creative_spectral_effects

Algorithm IDAssumptionsFailure ModesWhen Not To Use
creative_spectral_effects.cross_synthesis_vocoderSpectral manipulations are desired even with timbral coloration.Can introduce intentional but strong coloration or temporal artifacts.Avoid for transparent restoration/mastering paths.
creative_spectral_effects.formant_painting_warpingSpectral manipulations are desired even with timbral coloration.Can introduce intentional but strong coloration or temporal artifacts.Avoid for transparent restoration/mastering paths.
creative_spectral_effects.phase_randomization_texturesSpectral manipulations are desired even with timbral coloration.Can introduce intentional but strong coloration or temporal artifacts.Avoid for transparent restoration/mastering paths.
creative_spectral_effects.resonator_filterbank_morphingSpectral manipulations are desired even with timbral coloration.Can introduce intentional but strong coloration or temporal artifacts.Avoid for transparent restoration/mastering paths.
creative_spectral_effects.spectral_blur_smearSpectral manipulations are desired even with timbral coloration.Can introduce intentional but strong coloration or temporal artifacts.Avoid for transparent restoration/mastering paths.
creative_spectral_effects.spectral_contrast_exaggerationSpectral manipulations are desired even with timbral coloration.Can introduce intentional but strong coloration or temporal artifacts.Avoid for transparent restoration/mastering paths.
creative_spectral_effects.spectral_convolution_effectsSpectral manipulations are desired even with timbral coloration.Can introduce intentional but strong coloration or temporal artifacts.Avoid for transparent restoration/mastering paths.
creative_spectral_effects.spectral_freeze_banksSpectral manipulations are desired even with timbral coloration.Can introduce intentional but strong coloration or temporal artifacts.Avoid for transparent restoration/mastering paths.

denoise_and_restoration

Algorithm IDAssumptionsFailure ModesWhen Not To Use
denoise_and_restoration.declick_decrackle_median_wavelet_interpolationNoise/artifacts are distinguishable from desired signal statistics. Noise model should be representative of observed noise floor.Over-reduction can remove detail and create modulation artifacts. Mismatched noise estimate leaves residue or damages detail.Avoid high reduction settings on sparse acoustic sources without auditioning. Avoid static settings on rapidly varying nonstationary noise.
denoise_and_restoration.declip_via_sparse_reconstructionNoise/artifacts are distinguishable from desired signal statistics. Noise model should be representative of observed noise floor.Over-reduction can remove detail and create modulation artifacts. Mismatched noise estimate leaves residue or damages detail.Avoid high reduction settings on sparse acoustic sources without auditioning. Avoid static settings on rapidly varying nonstationary noise.
denoise_and_restoration.diffusion_based_speech_audio_denoiseNoise/artifacts are distinguishable from desired signal statistics. Noise model should be representative of observed noise floor.Over-reduction can remove detail and create modulation artifacts. Mismatched noise estimate leaves residue or damages detail.Avoid high reduction settings on sparse acoustic sources without auditioning. Avoid static settings on rapidly varying nonstationary noise.
denoise_and_restoration.log_mmseNoise/artifacts are distinguishable from desired signal statistics. Noise model should be representative of observed noise floor.Over-reduction can remove detail and create modulation artifacts. Mismatched noise estimate leaves residue or damages detail.Avoid high reduction settings on sparse acoustic sources without auditioning. Avoid static settings on rapidly varying nonstationary noise.
denoise_and_restoration.minimum_statistics_noise_trackingNoise/artifacts are distinguishable from desired signal statistics. Noise model should be representative of observed noise floor.Over-reduction can remove detail and create modulation artifacts. Mismatched noise estimate leaves residue or damages detail.Avoid high reduction settings on sparse acoustic sources without auditioning. Avoid static settings on rapidly varying nonstationary noise.
denoise_and_restoration.mmse_stsaNoise/artifacts are distinguishable from desired signal statistics. Noise model should be representative of observed noise floor.Over-reduction can remove detail and create modulation artifacts. Mismatched noise estimate leaves residue or damages detail.Avoid high reduction settings on sparse acoustic sources without auditioning. Avoid static settings on rapidly varying nonstationary noise.
denoise_and_restoration.rnnoise_style_denoiserNoise/artifacts are distinguishable from desired signal statistics. Noise model should be representative of observed noise floor.Over-reduction can remove detail and create modulation artifacts. Mismatched noise estimate leaves residue or damages detail.Avoid high reduction settings on sparse acoustic sources without auditioning. Avoid static settings on rapidly varying nonstationary noise.
denoise_and_restoration.wiener_denoisingNoise/artifacts are distinguishable from desired signal statistics. Noise model should be representative of observed noise floor.Over-reduction can remove detail and create modulation artifacts. Mismatched noise estimate leaves residue or damages detail.Avoid high reduction settings on sparse acoustic sources without auditioning. Avoid static settings on rapidly varying nonstationary noise.

dereverb_and_room_correction

Algorithm IDAssumptionsFailure ModesWhen Not To Use
dereverb_and_room_correction.blind_deconvolution_dereverbLate reverberation is separable from direct content under chosen model. Reverberation tail is assumed more diffuse than direct content.Speech/music clarity can drop if early reflections are over-suppressed. Over-suppression can thin tonal body and ambience.Avoid strong dereverb when room character is part of artistic intent. Avoid for intentionally wet effects unless mix preservation is planned.
dereverb_and_room_correction.drr_guided_dereverbLate reverberation is separable from direct content under chosen model. Reverberation tail is assumed more diffuse than direct content.Speech/music clarity can drop if early reflections are over-suppressed. Over-suppression can thin tonal body and ambience.Avoid strong dereverb when room character is part of artistic intent. Avoid for intentionally wet effects unless mix preservation is planned.
dereverb_and_room_correction.late_reverb_suppression_via_coherenceLate reverberation is separable from direct content under chosen model. Reverberation tail is assumed more diffuse than direct content.Speech/music clarity can drop if early reflections are over-suppressed. Over-suppression can thin tonal body and ambience.Avoid strong dereverb when room character is part of artistic intent. Avoid for intentionally wet effects unless mix preservation is planned.
dereverb_and_room_correction.multi_band_adaptive_deverbLate reverberation is separable from direct content under chosen model. Reverberation tail is assumed more diffuse than direct content.Speech/music clarity can drop if early reflections are over-suppressed. Over-suppression can thin tonal body and ambience.Avoid strong dereverb when room character is part of artistic intent. Avoid for intentionally wet effects unless mix preservation is planned.
dereverb_and_room_correction.neural_dereverb_moduleLate reverberation is separable from direct content under chosen model. Model priors assume training-like signal statistics. Reverberation tail is assumed more diffuse than direct content.Speech/music clarity can drop if early reflections are over-suppressed. Generalization gaps can produce unstable artifacts. Over-suppression can thin tonal body and ambience.Avoid strong dereverb when room character is part of artistic intent. Avoid fully unattended use on out-of-domain material. Avoid for intentionally wet effects unless mix preservation is planned.
dereverb_and_room_correction.room_impulse_inverse_filteringLate reverberation is separable from direct content under chosen model. Reverberation tail is assumed more diffuse than direct content.Speech/music clarity can drop if early reflections are over-suppressed. Over-suppression can thin tonal body and ambience.Avoid strong dereverb when room character is part of artistic intent. Avoid for intentionally wet effects unless mix preservation is planned.
dereverb_and_room_correction.spectral_decay_subtractionLate reverberation is separable from direct content under chosen model. Reverberation tail is assumed more diffuse than direct content.Speech/music clarity can drop if early reflections are over-suppressed. Over-suppression can thin tonal body and ambience.Avoid strong dereverb when room character is part of artistic intent. Avoid for intentionally wet effects unless mix preservation is planned.
dereverb_and_room_correction.wpe_dereverberationLate reverberation is separable from direct content under chosen model. Reverberation tail is assumed more diffuse than direct content.Speech/music clarity can drop if early reflections are over-suppressed. Over-suppression can thin tonal body and ambience.Avoid strong dereverb when room character is part of artistic intent. Avoid for intentionally wet effects unless mix preservation is planned.

dynamics_and_loudness

Algorithm IDAssumptionsFailure ModesWhen Not To Use
dynamics_and_loudness.ebu_r128_normalizationProgram dynamics fit compressor/limiter time constants and thresholds.Pumping, breathing, or overs if thresholds and release are mis-set.Avoid applying multiple strong dynamics stages without gain staging checks.
dynamics_and_loudness.itu_bs_1770_loudness_measurement_gatingProgram dynamics fit compressor/limiter time constants and thresholds.Pumping, breathing, or overs if thresholds and release are mis-set.Avoid applying multiple strong dynamics stages without gain staging checks.
dynamics_and_loudness.lufs_target_mastering_chainProgram dynamics fit compressor/limiter time constants and thresholds.Pumping, breathing, or overs if thresholds and release are mis-set.Avoid applying multiple strong dynamics stages without gain staging checks.
dynamics_and_loudness.multi_band_compressionProgram dynamics fit compressor/limiter time constants and thresholds.Pumping, breathing, or overs if thresholds and release are mis-set.Avoid applying multiple strong dynamics stages without gain staging checks.
dynamics_and_loudness.spectral_dynamics_bin_wise_compressor_expanderProgram dynamics fit compressor/limiter time constants and thresholds.Pumping, breathing, or overs if thresholds and release are mis-set.Avoid applying multiple strong dynamics stages without gain staging checks.
dynamics_and_loudness.transient_shapingProgram dynamics fit compressor/limiter time constants and thresholds.Pumping, breathing, or overs if thresholds and release are mis-set.Avoid applying multiple strong dynamics stages without gain staging checks.
dynamics_and_loudness.true_peak_limitingProgram dynamics fit compressor/limiter time constants and thresholds.Pumping, breathing, or overs if thresholds and release are mis-set.Avoid applying multiple strong dynamics stages without gain staging checks.
dynamics_and_loudness.upward_compressionProgram dynamics fit compressor/limiter time constants and thresholds.Pumping, breathing, or overs if thresholds and release are mis-set.Avoid applying multiple strong dynamics stages without gain staging checks.

granular_and_modulation

Algorithm IDAssumptionsFailure ModesWhen Not To Use
granular_and_modulation.am_fm_ring_modulation_blocksGrain and modulation rates are musically matched to source texture.Incoherent grain scheduling can produce choppiness or blur.Avoid dense granular settings on speech intelligibility-critical content.
granular_and_modulation.envelope_followed_modulation_routingGrain and modulation rates are musically matched to source texture.Incoherent grain scheduling can produce choppiness or blur.Avoid dense granular settings on speech intelligibility-critical content.
granular_and_modulation.formant_lfo_modulationGrain and modulation rates are musically matched to source texture.Incoherent grain scheduling can produce choppiness or blur.Avoid dense granular settings on speech intelligibility-critical content.
granular_and_modulation.freeze_grain_morphingGrain and modulation rates are musically matched to source texture.Incoherent grain scheduling can produce choppiness or blur.Avoid dense granular settings on speech intelligibility-critical content.
granular_and_modulation.grain_cloud_pitch_texturesGrain and modulation rates are musically matched to source texture.Incoherent grain scheduling can produce choppiness or blur.Avoid dense granular settings on speech intelligibility-critical content.
granular_and_modulation.granular_time_stretch_engineGrain and modulation rates are musically matched to source texture.Incoherent grain scheduling can produce choppiness or blur.Avoid dense granular settings on speech intelligibility-critical content.
granular_and_modulation.rhythmic_gate_stutter_quantizerGrain and modulation rates are musically matched to source texture.Incoherent grain scheduling can produce choppiness or blur.Avoid dense granular settings on speech intelligibility-critical content.
granular_and_modulation.spectral_tremoloGrain and modulation rates are musically matched to source texture.Incoherent grain scheduling can produce choppiness or blur.Avoid dense granular settings on speech intelligibility-critical content.

pitch_detection_and_tracking

Algorithm IDAssumptionsFailure ModesWhen Not To Use
pitch_detection_and_tracking.crepe_style_neural_f0F0 evidence is strong in the selected analysis band and frame size. Model priors assume training-like signal statistics.Octave errors and voicing flips under heavy noise/polyphony. Generalization gaps can produce unstable artifacts.Avoid as the sole control signal for dense polyphonic mixtures. Avoid fully unattended use on out-of-domain material.
pitch_detection_and_tracking.harmonic_product_spectrum_hpsF0 evidence is strong in the selected analysis band and frame size.Octave errors and voicing flips under heavy noise/polyphony.Avoid as the sole control signal for dense polyphonic mixtures.
pitch_detection_and_tracking.pyinF0 evidence is strong in the selected analysis band and frame size.Octave errors and voicing flips under heavy noise/polyphony.Avoid as the sole control signal for dense polyphonic mixtures.
pitch_detection_and_tracking.raptF0 evidence is strong in the selected analysis band and frame size.Octave errors and voicing flips under heavy noise/polyphony.Avoid as the sole control signal for dense polyphonic mixtures.
pitch_detection_and_tracking.subharmonic_summationF0 evidence is strong in the selected analysis band and frame size.Octave errors and voicing flips under heavy noise/polyphony.Avoid as the sole control signal for dense polyphonic mixtures.
pitch_detection_and_tracking.swipeF0 evidence is strong in the selected analysis band and frame size.Octave errors and voicing flips under heavy noise/polyphony.Avoid as the sole control signal for dense polyphonic mixtures.
pitch_detection_and_tracking.viterbi_smoothed_pitch_contour_trackingF0 evidence is strong in the selected analysis band and frame size.Octave errors and voicing flips under heavy noise/polyphony.Avoid as the sole control signal for dense polyphonic mixtures.
pitch_detection_and_tracking.yinF0 evidence is strong in the selected analysis band and frame size.Octave errors and voicing flips under heavy noise/polyphony.Avoid as the sole control signal for dense polyphonic mixtures.

retune_and_intonation

Algorithm IDAssumptionsFailure ModesWhen Not To Use
retune_and_intonation.adaptive_intonation_context_sensitive_intervalsDetected notes map cleanly to intended tonal center/scale. Pitch trajectory estimates should be continuous enough for retuning.Over-correction can flatten expressive vibrato or slides. Fast F0 jumps can cause audible stepping.Avoid aggressive correction when preserving natural micro-intonation is required. Avoid high-strength retune on breath/noise segments.
retune_and_intonation.chord_aware_retuningDetected notes map cleanly to intended tonal center/scale. Pitch trajectory estimates should be continuous enough for retuning.Over-correction can flatten expressive vibrato or slides. Fast F0 jumps can cause audible stepping.Avoid aggressive correction when preserving natural micro-intonation is required. Avoid high-strength retune on breath/noise segments.
retune_and_intonation.just_intonation_mapping_per_key_centerDetected notes map cleanly to intended tonal center/scale. Pitch trajectory estimates should be continuous enough for retuning.Over-correction can flatten expressive vibrato or slides. Fast F0 jumps can cause audible stepping.Avoid aggressive correction when preserving natural micro-intonation is required. Avoid high-strength retune on breath/noise segments.
retune_and_intonation.key_aware_retuning_with_confidence_weightingDetected notes map cleanly to intended tonal center/scale. Pitch trajectory estimates should be continuous enough for retuning.Over-correction can flatten expressive vibrato or slides. Fast F0 jumps can cause audible stepping.Avoid aggressive correction when preserving natural micro-intonation is required. Avoid high-strength retune on breath/noise segments.
retune_and_intonation.portamento_aware_retune_curvesDetected notes map cleanly to intended tonal center/scale. Pitch trajectory estimates should be continuous enough for retuning.Over-correction can flatten expressive vibrato or slides. Fast F0 jumps can cause audible stepping.Avoid aggressive correction when preserving natural micro-intonation is required. Avoid high-strength retune on breath/noise segments.
retune_and_intonation.scala_mts_scale_import_and_quantizationDetected notes map cleanly to intended tonal center/scale. Pitch trajectory estimates should be continuous enough for retuning.Over-correction can flatten expressive vibrato or slides. Fast F0 jumps can cause audible stepping.Avoid aggressive correction when preserving natural micro-intonation is required. Avoid high-strength retune on breath/noise segments.
retune_and_intonation.time_varying_cents_mapsDetected notes map cleanly to intended tonal center/scale. Pitch trajectory estimates should be continuous enough for retuning.Over-correction can flatten expressive vibrato or slides. Fast F0 jumps can cause audible stepping.Avoid aggressive correction when preserving natural micro-intonation is required. Avoid high-strength retune on breath/noise segments.
retune_and_intonation.vibrato_preserving_correctionDetected notes map cleanly to intended tonal center/scale. Pitch trajectory estimates should be continuous enough for retuning.Over-correction can flatten expressive vibrato or slides. Fast F0 jumps can cause audible stepping.Avoid aggressive correction when preserving natural micro-intonation is required. Avoid high-strength retune on breath/noise segments.

separation_and_decomposition

Algorithm IDAssumptionsFailure ModesWhen Not To Use
separation_and_decomposition.demucs_style_stem_separation_backendSources have partially separable spectral or statistical structure.Component bleeding and musical noise under overlap or model mismatch.Avoid expecting perfect stems from strongly correlated or co-modulated sources.
separation_and_decomposition.ica_bss_for_multichannel_stemsSources have partially separable spectral or statistical structure.Component bleeding and musical noise under overlap or model mismatch.Avoid expecting perfect stems from strongly correlated or co-modulated sources.
separation_and_decomposition.nmf_decompositionSources have partially separable spectral or statistical structure.Component bleeding and musical noise under overlap or model mismatch.Avoid expecting perfect stems from strongly correlated or co-modulated sources.
separation_and_decomposition.probabilistic_latent_component_separationSources have partially separable spectral or statistical structure.Component bleeding and musical noise under overlap or model mismatch.Avoid expecting perfect stems from strongly correlated or co-modulated sources.
separation_and_decomposition.rpca_hpssSources have partially separable spectral or statistical structure.Component bleeding and musical noise under overlap or model mismatch.Avoid expecting perfect stems from strongly correlated or co-modulated sources.
separation_and_decomposition.sinusoidal_residual_transient_decompositionSources have partially separable spectral or statistical structure.Component bleeding and musical noise under overlap or model mismatch.Avoid expecting perfect stems from strongly correlated or co-modulated sources.
separation_and_decomposition.tensor_decomposition_cp_tuckerSources have partially separable spectral or statistical structure.Component bleeding and musical noise under overlap or model mismatch.Avoid expecting perfect stems from strongly correlated or co-modulated sources.
separation_and_decomposition.u_net_vocal_accompaniment_splitSources have partially separable spectral or statistical structure.Component bleeding and musical noise under overlap or model mismatch.Avoid expecting perfect stems from strongly correlated or co-modulated sources.

spatial_and_multichannel

Algorithm IDAssumptionsFailure ModesWhen Not To Use
spatial_and_multichannel.binaural_itd_ild_synthesisChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.binaural_motion_trajectory_designerChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.coherence_based_dereverb_multichannelChannel geometry/order and timing metadata are correct. Reverberation tail is assumed more diffuse than direct content.Spatial collapse, combing, or localization bias from misalignment. Over-suppression can thin tonal body and ambience.Avoid blind spatial processing when channel order/calibration is unknown. Avoid for intentionally wet effects unless mix preservation is planned.
spatial_and_multichannel.cross_channel_click_pop_repairChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.dbap_distance_based_amplitude_panningChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.decorrelated_reverb_upmixChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.microphone_array_calibration_tonesChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.multichannel_noise_psd_trackingChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.multichannel_wiener_postfilterChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.phase_aligned_mid_side_field_rotationChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.phase_consistent_multichannel_denoiseChannel geometry/order and timing metadata are correct. Noise model should be representative of observed noise floor.Spatial collapse, combing, or localization bias from misalignment. Mismatched noise estimate leaves residue or damages detail.Avoid blind spatial processing when channel order/calibration is unknown. Avoid static settings on rapidly varying nonstationary noise.
spatial_and_multichannel.pvx_directional_spectral_warpChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.pvx_interaural_coherence_shapingChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.pvx_interchannel_phase_lockingChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.pvx_multichannel_time_alignmentChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.pvx_spatial_freeze_and_trajectoryChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.pvx_spatial_transient_preservationChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.rotating_speaker_doppler_fieldChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.spatial_freeze_resynthesisChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.spectral_spatial_granulatorChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.stereo_width_frequency_dependent_controlChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.stochastic_spatial_diffusion_cloudChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.transaural_crosstalk_cancellationChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.
spatial_and_multichannel.vbap_adaptive_panningChannel geometry/order and timing metadata are correct.Spatial collapse, combing, or localization bias from misalignment.Avoid blind spatial processing when channel order/calibration is unknown.

spectral_time_frequency_transforms

Algorithm IDAssumptionsFailure ModesWhen Not To Use
spectral_time_frequency_transforms.chirplet_transform_analysisTransform parameterization matches target time-frequency structure.Incorrect parameterization can smear events or over-fragment spectra.Avoid default settings for highly nonstationary signals without tuning.
spectral_time_frequency_transforms.constant_q_transform_cqt_processingTransform parameterization matches target time-frequency structure.Incorrect parameterization can smear events or over-fragment spectra.Avoid default settings for highly nonstationary signals without tuning.
spectral_time_frequency_transforms.multi_window_stft_fusionTransform parameterization matches target time-frequency structure.Incorrect parameterization can smear events or over-fragment spectra.Avoid default settings for highly nonstationary signals without tuning.
spectral_time_frequency_transforms.nsgt_based_processingTransform parameterization matches target time-frequency structure.Incorrect parameterization can smear events or over-fragment spectra.Avoid default settings for highly nonstationary signals without tuning.
spectral_time_frequency_transforms.reassigned_spectrogram_methodsTransform parameterization matches target time-frequency structure.Incorrect parameterization can smear events or over-fragment spectra.Avoid default settings for highly nonstationary signals without tuning.
spectral_time_frequency_transforms.synchrosqueezed_stftTransform parameterization matches target time-frequency structure.Incorrect parameterization can smear events or over-fragment spectra.Avoid default settings for highly nonstationary signals without tuning.
spectral_time_frequency_transforms.variable_q_transform_vqtTransform parameterization matches target time-frequency structure.Incorrect parameterization can smear events or over-fragment spectra.Avoid default settings for highly nonstationary signals without tuning.
spectral_time_frequency_transforms.wavelet_packet_processingTransform parameterization matches target time-frequency structure.Incorrect parameterization can smear events or over-fragment spectra.Avoid default settings for highly nonstationary signals without tuning.

time_scale_and_pitch_core

Algorithm IDAssumptionsFailure ModesWhen Not To Use
time_scale_and_pitch_core.beat_synchronous_time_warpingFrames are locally quasi-stationary and harmonic evolution is reasonably smooth.High-ratio stretch can introduce phasiness and blurred transients.Avoid for extreme percussive-only material when attack realism is critical.
time_scale_and_pitch_core.harmonic_percussive_split_tsmFrames are locally quasi-stationary and harmonic evolution is reasonably smooth.High-ratio stretch can introduce phasiness and blurred transients.Avoid for extreme percussive-only material when attack realism is critical.
time_scale_and_pitch_core.lp_psolaFrames are locally quasi-stationary and harmonic evolution is reasonably smooth.High-ratio stretch can introduce phasiness and blurred transients.Avoid for extreme percussive-only material when attack realism is critical.
time_scale_and_pitch_core.multi_resolution_phase_vocoderFrames are locally quasi-stationary and harmonic evolution is reasonably smooth. Phase continuity assumptions hold best for moderate stretch ratios.High-ratio stretch can introduce phasiness and blurred transients. Extreme settings increase phasiness/transient blur risk.Avoid for extreme percussive-only material when attack realism is critical. Avoid very large stretch+pitch shifts without transient controls.
time_scale_and_pitch_core.nonlinear_time_mapsFrames are locally quasi-stationary and harmonic evolution is reasonably smooth.High-ratio stretch can introduce phasiness and blurred transients.Avoid for extreme percussive-only material when attack realism is critical.
time_scale_and_pitch_core.td_psolaFrames are locally quasi-stationary and harmonic evolution is reasonably smooth.High-ratio stretch can introduce phasiness and blurred transients.Avoid for extreme percussive-only material when attack realism is critical.
time_scale_and_pitch_core.wsola_waveform_similarity_overlap_addFrames are locally quasi-stationary and harmonic evolution is reasonably smooth.High-ratio stretch can introduce phasiness and blurred transients.Avoid for extreme percussive-only material when attack realism is critical.

Attribution

See ATTRIBUTION.md.