We present novel low-level audio features that are based on correlations between sub-band audio signals decomposed by undecimated wavelet transform. Under the assumption that SVM is used for classifier learning, the experimental results on GTZAN dataset showed that the proposed method demonstrated the best accuracy of 84.8%, outperforming the conventional methods.

footer 著作権について 倫理綱領 プライバシーポリシー セキュリティ 情報処理学会