梅尔频谱倒谱系数 Mel Frequency Cepstral Coefficients (MFCC)
滤波器组 Filterbank Energies
指数滤波器组 Log Filterbank Energies
声音动态检测 Voice Activity Detection (VAD)
使用方法
import spectra_torch.base as mm
import torchaudio as ta
sig, sr = ta.load_wav('singing-01-003.wav')
sig = sig[0]
mfcc = mm.mfcc(sig, sr)# MFCC
starts, detection = mm.is_speech(sig, sr, speechlen=1)# VAD