Google Scholar

User profiles for Koichi Shinoda

Koichi SHINODA

Tokyo Institute of Technology

Verified email at cs.titech.ac.jp

Cited by 3977

[PDF] arxiv.org

Attentive statistics pooling for deep speaker embedding

K Okabe, T Koshinaka, K Shinoda - arXiv preprint arXiv:1803.10963, 2018 - arxiv.org

This paper proposes attentive statistics pooling for deep speaker embedding in text-independent
speaker verification. In conventional speaker embedding, frame-level features are …

Save Cite Cited by 557 Related articles All 10 versions View as HTML

[PDF] jst.go.jp

MDL-based context-dependent subword modeling for speech recognition

K Shinoda, T Watanabe - Acoustical Science and Technology, 2000 - jstage.jst.go.jp

Context-dependent phone units, such as triphones, have recently come to be used to model
subword units in speech recognition systems that are based on the use of hidden Markov …

Save Cite Cited by 370 Related articles All 6 versions

A structural Bayes approach to speaker adaptation

K Shinoda, CH Lee - IEEE Transactions on Speech and Audio …, 2001 - ieeexplore.ieee.org

Maximum a posteriori (MAP) estimation has been successfully applied to speaker adaptation
in speech recognition systems using hidden Markov models. When the amount of data is …

Save Cite Cited by 219 Related articles All 6 versions

[PDF] isca-archive.org

[PDF][PDF] Acoustic modeling based on the MDL principle for speech recognition.

K Shinoda, T Watanabe - Eurospeech, 1997 - isca-archive.org

Recently context-dependent phone units, such as triphones, have been used to model subword
units in speech recognition based on Hidden Markov Models (HMMs). While most such …

Save Cite Cited by 202 Related articles All 9 versions View as HTML

[PDF] arxiv.org

Multimodal emotion recognition with high-level speech and text features

MR Makiuchi, K Uto, K Shinoda - 2021 IEEE Automatic Speech …, 2021 - ieeexplore.ieee.org

Automatic emotion recognition is one of the central concerns of the Human-Computer Interaction
field as it can bridge the gap between humans and machines. Current works train deep …

Save Cite Cited by 66 Related articles All 9 versions

[PDF] titech.ac.jp

Multimodal fusion of bert-cnn and gated cnn representations for depression detection

…, T Warnita, K Uto, K Shinoda - Proceedings of the 9th …, 2019 - dl.acm.org

Depression is a common, but serious mental disorder that affects people all over the world.
Besides providing an easier way of diagnosing the disorder, a computer-aided automatic …

Save Cite Cited by 108 Related articles All 6 versions

[PDF] arxiv.org

Implicit neural representations for variable length human motion generation

P Cervantes, Y Sekikawa, I Sato, K Shinoda - European Conference on …, 2022 - Springer

We propose an action-conditional human motion generation method using variational
implicit neural representations (INR). The variational formalism enables action-conditional …

Save Cite Cited by 37 Related articles All 5 versions

[PDF] arxiv.org

Detecting Alzheimer's disease using gated convolutional neural network from audio data

T Warnita, N Inoue, K Shinoda - arXiv preprint arXiv:1803.11344, 2018 - arxiv.org

We propose an automatic detection method of Alzheimer's diseases using a gated convolutional
neural network (GCNN) from speech data. This GCNN can be trained with a relatively …

Save Cite Cited by 52 Related articles All 9 versions View as HTML

[PDF] titech.ac.jp

User adaptation of convolutional neural network for human activity recognition

…, Y Akagi, G Nagino, K Shinoda - 2017 25th European …, 2017 - ieeexplore.ieee.org

Recently, monitoring human activities using smart-phone sensors, such as accelerometers,
magnetometers, and gyro-scopes, has been proved effective to improve productivity in daily …

Save Cite Cited by 52 Related articles All 11 versions

[PDF] jst.go.jp

Flower colors and their anthocyanins in Matthiola incana cultivars (Brassicaceae)

F Tatsuzawa, N Saito, K Toki, K Shinoda… - Journal of the …, 2012 - jstage.jst.go.jp

The flower colors and anthocyanin constitution of eight cultivars of Vintage series bedding
Stock (Matthiola incana) were surveyed to determine the relation between their flower colors …

Save Cite Cited by 62 Related articles All 8 versions

Create alert

Cite

Advanced search

Saved to My library

User profiles for Koichi Shinoda

Koichi SHINODA

Attentive statistics pooling for deep speaker embedding

MDL-based context-dependent subword modeling for speech recognition

A structural Bayes approach to speaker adaptation

[PDF][PDF] Acoustic modeling based on the MDL principle for speech recognition.

Multimodal emotion recognition with high-level speech and text features

Multimodal fusion of bert-cnn and gated cnn representations for depression detection

Implicit neural representations for variable length human motion generation

Detecting Alzheimer's disease using gated convolutional neural network from audio data

User adaptation of convolutional neural network for human activity recognition

Flower colors and their anthocyanins in Matthiola incana cultivars (Brassicaceae)