VS Code Dev Container
Sources:
- Developing inside a Container from VS Code.
- Custom Dev Container Features from VS Code.
- See my github repo for my DevContainer confguration.
Sources:
Sources:
Stochastic Gradient Descent (SGD) algorithm: minimize \(J(w)=\mathbb{E}[f(w, X)]\) using \(\left\{\nabla_w f\left(w_k, x_k\right)\right\}\) \[ w_{k+1}=w_k-\alpha_k \nabla_w f\left(w_k, x_k\right) \]
Sources:
Robbins-Monro (RM) algorithm: solve \(g(w)=0\) using \(\left\{\tilde{g}\left(w_k, \eta_k\right)\right\}\) \[ w_{k+1}=w_k-a_k \tilde{g}\left(w_k, \eta_k\right) \]
Sources:
MoCo and BYOL are both famous contrastive learning and self-supervised learning frameworks. They introduce an interesting designs:
This design is heavily used in further research like DINO as is worth learning about.
You may need to read my post for SimCLR to get a deeper understanding of contrastive learning.
Deep Q-learning or deep Q-network (DQN) is one of the earliest and most successful algorithms that introduce deep neural networks into RL.
DQN is not Q learning, at least not the specific Q learning algorithm introduced in my post, but it shares the core idea of Q learning.
Sources:
Sources:
Sources: