Published onSeptember 16, 2023Retentive Networks and RWKVmlnlpA short, hand-wavy explainer for the mathematical intuition behind faster attention mechanisms.
Published onJune 29, 2023Fast Attention ImplementationsmlnlpreferencesA reference collection of fast attention implementations.
Published onJune 16, 2023RWKV Language Model MathmlnlpmathIn-depth explanation of the math behind the RWKV model, with PyTorch implementations, plus a discussion of numerical stability.
Published onAugust 2, 2016Using Gensim Word2Vec Embeddings in KerasmlnlpA short post and script regarding using Gensim Word2Vec embeddings in Keras, with example code.
Published onApril 27, 2016Question Answering using KerasmlnlpAn in-depth introduction to using Keras for language modeling; word embedding, recurrent and convolutional neural networks, attentional RNNs, and similarity metrics for vector embeddings.