Apply deep learning to audio and speech processing applications by using Deep Learning Toolbox™ together with Audio Toolbox™.
Audio Labeler | Define and visualize ground-truth labels |
audioDatastore | Datastore for collection of audio files |
audioDataAugmenter | Augment audio data |
audioFeatureExtractor | Streamline audio feature extraction |
vggishFeatures | Extract VGGish features |
vggish | VGGish neural network |
yamnet | YAMNet neural network |
yamnetGraph | Graph of YAMNet AudioSet ontology |
classifySound | Classify sounds in audio signal |
Introduction to Deep Learning for Audio Applications (Audio Toolbox)
Learn common tools and workflows to apply deep learning to audio applications.
Classify Sound Using Deep Learning (Audio Toolbox)
Train, validate, and test a simple long short-term memory (LSTM) to classify sounds.
Transfer Learning with Pretrained Audio Networks (Audio Toolbox)
Use transfer learning to retrain YAMNet, a pretrained convolutional neural network (CNN), to classify a new set of audio signals.