deep-learning

Monotonic AttentionNovember 9, 2023
Write-up explaining my implementation of monotonic attention using a probabilistic graphical model.
machine-learning speech deep-learning algorithms
Retentive Networks and RWKVSeptember 16, 2023
A short, hand-wavy explainer for the mathematical intuition behind faster attention mechanisms.
machine-learning nlp deep-learning algorithms
RWKV Language Model MathJune 16, 2023
In-depth explanation of the math behind the RWKV model, with PyTorch implementations, plus a discussion of numerical stability.
machine-learning nlp deep-learning math optimization
Robotics Pre-training IdeaNovember 1, 2022
A collection of my ideas relating to robotics pre-training.
robotics machine-learning deep-learning generative-models
Using Gensim Word2Vec Embeddings in KerasAugust 2, 2016
A short post and script regarding using Gensim Word2Vec embeddings in Keras, with example code.
machine-learning deep-learning nlp ml-frameworks
Restricted Boltzmann MachinesJuly 18, 2016
Building on the Recurrent RBM for sequence modeling. This post relates to what I am doing for my Master's thesis.
machine-learning deep-learning generative-models probabilistic-models
Question Answering using KerasApril 27, 2016
An in-depth introduction to using Keras for language modeling; word embedding, recurrent and convolutional neural networks, attentional RNNs, and similarity metrics for vector embeddings.
machine-learning deep-learning nlp ml-frameworks
A Neural Network in 28 Lines of TheanoFebruary 23, 2016
A quick introduction to using Theano for deep learning, from the bare-bones to a full neural network.
machine-learning deep-learning ml-frameworks software-engineering