11.6 ms (512 samples at 44.1 kHz) – suitable for live performance. 4. Perceptual Evaluation A pilot listening test was conducted with 30 participants (20 audio professionals, 10 naive listeners).
5 spoken phrases ("The moon rises over the silent field") processed through SVVG v1.0 at three EC settings (0.3, 0.6, 0.9). Spirit Voice Vocal Generator v1.0
| Parameter | Range | Default | Description | |-----------|-------|---------|-------------| | Spectral Dispersion | 0 – 1.0 | 0.65 | Degree of formant stretching/compression | | Subharmonic Mix | -inf – +6 dB | -3 dB | Level of f0/2 and f0/3 components | | Turbulence Density | 0 – 1.0 | 0.4 | Amplitude of stochastic noise layer | | Temporal Smear | 0 – 50 ms | 15 ms | Phase randomization across frequency bins | | Dry/Wet Mix | 0 – 1.0 | 0.7 | Balance of original vs. processed signal | 5 spoken phrases ("The moon rises over the
11.6 ms (512 samples at 44.1 kHz) – suitable for live performance. 4. Perceptual Evaluation A pilot listening test was conducted with 30 participants (20 audio professionals, 10 naive listeners).
5 spoken phrases ("The moon rises over the silent field") processed through SVVG v1.0 at three EC settings (0.3, 0.6, 0.9).
| Parameter | Range | Default | Description | |-----------|-------|---------|-------------| | Spectral Dispersion | 0 – 1.0 | 0.65 | Degree of formant stretching/compression | | Subharmonic Mix | -inf – +6 dB | -3 dB | Level of f0/2 and f0/3 components | | Turbulence Density | 0 – 1.0 | 0.4 | Amplitude of stochastic noise layer | | Temporal Smear | 0 – 50 ms | 15 ms | Phase randomization across frequency bins | | Dry/Wet Mix | 0 – 1.0 | 0.7 | Balance of original vs. processed signal |