STOI 短时客观可懂度MATLAB代码

大小: 8KB

文件类型: .m

金币: 2

下载: 1 次

发布日期: 2021-05-09
语言: Matlab
标签: STOI MATLAB

高速下载

资源简介

代码亲测可用，直接调用stoi函数，格式是stoi(x,y,fs)，分别是干净语音，带噪语音，采样频率

资源截图

小图大图

代码片段和文件信息

function d = stoi（x y fs_signal）
%   d = stoi（x y fs_signal） returns the output of the short-time
%   objective intelligibility （STOI） measure described in [1 2] where x 
%   and y denote the clean and processed speech respectively with sample
%   rate fs_signal in Hz. The output d is expected to have a monotonic 
%   relation with the subjective speech-intelligibility where a higher d 
%   denotes better intelligible speech. See [1 2] for more details.
%
%   References:
%      [1] C.H.Taal R.C.Hendriks R.Heusdens J.Jensen ‘A Short-Time
%      objective Intelligibility Measure for Time-Frequency Weighted Noisy
%      Speech‘ ICASSP 2010 Texas Dallas.
%
%      [2] C.H.Taal R.C.Hendriks R.Heusdens J.Jensen ‘An Algorithm for 
%      Intelligibility Prediction of Time-Frequency Weighted Noisy Speech‘ 
%      IEEE Transactions on Audio Speech and Language Processing 2011. 
%
%
% Copyright 2009: Delft University of Technology Signal & Information
% Processing Lab. The software is free for non-commercial use. This program
% comes WITHOUT ANY WARRANTY.
%
%
%
% Updates:
% 2011-04-26 Using the more efficient ‘taa_corr‘ instead of ‘corr‘

if length（x）~=length（y）
    error（‘x and y should have the same length‘）;
end

% initialization
x           = x（:）;                             % clean speech column vector
y           = y（:）;                             % processed speech column vector

fs          = 10000;                            % sample rate of proposed intelligibility measure
N_frame    	= 256;                              % window support
K           = 512;                              % FFT size
J           = 15;                               % Number of 1/3 octave bands
mn          = 150;                              % Center frequency of first 1/3 octave band in Hz.
H           = thirdoct（fs K J mn）;           % Get 1/3 octave band matrix
N           = 30;                               % Number of frames for intermediate intelligibility measure （Length analysis window）
Beta        = -15;                           	% lower SDR-bound
dyn_range   = 40;                               % speech dynamic range

% resample signals if other samplerate is used than fs
if fs_signal ~= fs
    x	= resample（x fs fs_signal）;
    y 	= resample（y fs fs_signal）;
end

% remove silent frames
[x y] = removeSilentframes（x y dyn_range N_frame N_frame/2）;

% apply 1/3 octave band TF-decomposition
x_hat     	= stdft（x N_frame N_frame/2 K）; 	% apply short-time DFT to clean speech
y_hat     	= stdft（y N_frame N_frame/2 K）; 	% apply short-time DFT to processed speech

x_hat       = x_hat（: 1:（K/2+1））.‘;         	% take clean single-sided spectrum
y_hat       = y_hat（: 1:（K/2+1））.‘;        	% take processed single-sided spectrum

X           = zeros（J size（x_hat 2））;         % init memory for clean speech 1/3 octave band TF-representation 
Y           = zeros（J size（y_

上一篇：matlab的pwm函数编写
下一篇：水平集图像分割的Matlab程序代码

共有条评论

STOI 短时客观可懂度MATLAB代码

资源简介

资源截图

代码片段和文件信息

评论

相关资源