Yamagishi Laboratory, National Institute of Informatics, Japan

22 Seguidores

19 SlideShares 22 Seguidores 2 Seguindos

speech synthesis deep learning speech information processing wavenet tacotron mean opinion score attention speaker verification ai 音声研究会チュートリアル音声合成 machine learning tts acoustic environment device recording speech enhancement speech dataset speech quality assessment synthetic speech evaluation voicemos challenge multilingual hifi-gan correlation alignment self-supervised learning speaker anonymization multiple enrollment spoofing aware mos prediction speech naturalness assessment logical access countermeasure presentation attack detection anti-spoofing resnet tdnn listening test evaluation midi music synthesis vector quantization voice conversion text-to-speech waveform generation neural waveform models テキスト音声合成ディープラーニング

Ver mais

Atividades
Sobre

Yamagishi Laboratory, National Institute of Informatics, Japan

Apresentações

エンドツーエンド音声合成に向けたNIIにおけるソフトウェア群～ TacotronとWaveNetのチュートリアル (Part 1)～

エンドツーエンド音声合成に向けたNIIにおけるソフトウェア群～ TacotronとWaveNetのチュートリアル (Part 2)～

Tutorial on end-to-end text-to-speech synthesis: Part 1 – Neural waveform modeling

Tutorial on end-to-end text-to-speech synthesis: Part 2 – Tactron and related end-to-end systems

Neural source-filter waveform model

Neural Waveform Modeling

Advancements in Neural Vocoders

Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance

Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis

How do Voices from Past Speech Synthesis Challenges Compare Today?

Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances

Estimating the confidence of speech spoofing countermeasure

Generalization Ability of MOS Prediction Networks

Odyssey 2022: Investigating self-supervised front ends for speech spoofing countermeasures

Odyssey 2022: Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models

Spoofing-aware Attention Back-end with Multiple Enrollment and Novel Trials Sampling Strategy for SASVC 2022

Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions

The VoiceMOS Challenge 2022

DDS: A new device-degraded speech dataset for speech enhancement