User profiles for Koichi Shinoda
Koichi SHINODATokyo Institute of Technology Verified email at cs.titech.ac.jp Cited by 3977 |
Attentive statistics pooling for deep speaker embedding
This paper proposes attentive statistics pooling for deep speaker embedding in text-independent
speaker verification. In conventional speaker embedding, frame-level features are …
speaker verification. In conventional speaker embedding, frame-level features are …
MDL-based context-dependent subword modeling for speech recognition
K Shinoda, T Watanabe - Acoustical Science and Technology, 2000 - jstage.jst.go.jp
Context-dependent phone units, such as triphones, have recently come to be used to model
subword units in speech recognition systems that are based on the use of hidden Markov …
subword units in speech recognition systems that are based on the use of hidden Markov …
A structural Bayes approach to speaker adaptation
Maximum a posteriori (MAP) estimation has been successfully applied to speaker adaptation
in speech recognition systems using hidden Markov models. When the amount of data is …
in speech recognition systems using hidden Markov models. When the amount of data is …
[PDF][PDF] Acoustic modeling based on the MDL principle for speech recognition.
K Shinoda, T Watanabe - Eurospeech, 1997 - isca-archive.org
Recently context-dependent phone units, such as triphones, have been used to model subword
units in speech recognition based on Hidden Markov Models (HMMs). While most such …
units in speech recognition based on Hidden Markov Models (HMMs). While most such …
Multimodal emotion recognition with high-level speech and text features
Automatic emotion recognition is one of the central concerns of the Human-Computer Interaction
field as it can bridge the gap between humans and machines. Current works train deep …
field as it can bridge the gap between humans and machines. Current works train deep …
Multimodal fusion of bert-cnn and gated cnn representations for depression detection
Depression is a common, but serious mental disorder that affects people all over the world.
Besides providing an easier way of diagnosing the disorder, a computer-aided automatic …
Besides providing an easier way of diagnosing the disorder, a computer-aided automatic …
Implicit neural representations for variable length human motion generation
We propose an action-conditional human motion generation method using variational
implicit neural representations (INR). The variational formalism enables action-conditional …
implicit neural representations (INR). The variational formalism enables action-conditional …
Detecting Alzheimer's disease using gated convolutional neural network from audio data
We propose an automatic detection method of Alzheimer's diseases using a gated convolutional
neural network (GCNN) from speech data. This GCNN can be trained with a relatively …
neural network (GCNN) from speech data. This GCNN can be trained with a relatively …
User adaptation of convolutional neural network for human activity recognition
…, Y Akagi, G Nagino, K Shinoda - 2017 25th European …, 2017 - ieeexplore.ieee.org
Recently, monitoring human activities using smart-phone sensors, such as accelerometers,
magnetometers, and gyro-scopes, has been proved effective to improve productivity in daily …
magnetometers, and gyro-scopes, has been proved effective to improve productivity in daily …
Flower colors and their anthocyanins in Matthiola incana cultivars (Brassicaceae)
F Tatsuzawa, N Saito, K Toki, K Shinoda… - Journal of the …, 2012 - jstage.jst.go.jp
The flower colors and anthocyanin constitution of eight cultivars of Vintage series bedding
Stock (Matthiola incana) were surveyed to determine the relation between their flower colors …
Stock (Matthiola incana) were surveyed to determine the relation between their flower colors …