I made available VGG16, VGG19, ResNet50: Keras code +ImageNet weights for both TF and Theano.https://github.com/fchollet/deep-learning-models …
-
-
audio > spectrogram (2d matrix, y axis is frequency, x is time), 2Dmat > convnet, then slide it by time.pic.twitter.com/QqmCk0PwQu
-
the raw output of the conv net needs to be passed through the LSTM by time. Ideal for speech.https://arxiv.org/abs/1512.02595
- Show replies
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.