yag_aysのブックマーク - はてなブックマーク

Non-Zero Initial States for Recurrent Neural Networks - R2RT
The default approach to initializing the state of an RNN is to use a zero state. This often works well, particularly for sequence-to-sequence tasks like language modeling where the proportion of outputs that are significantly impacted by the initial state is small. In some cases, however, it makes sense to (1) train the initial state as a model parameter, (2) use a noisy initial state, or (3) both
yag_ays 2017/08/15
リンク
Written Memories: Understanding, Deriving and Extending the LSTM - R2RT
When I was first introduced to Long Short-Term Memory networks (LSTMs), it was hard to look past their complexity. I didn’t understand why they were designed the way they were designed, just that they worked. It turns out that LSTMs can be understood, and that, despite their superficial complexity, LSTMs are actually based on a couple incredibly simple, even beautiful, insights into neural network
yag_ays 2016/08/07
リンク
1

はてなブックマーク