Deep Reinforcement Learning Parameters

Epsilon (ϵepsilon 𝜖): This is a hyperparameter (a value between 0 and 1) that defines the probability of an agent taking a random action (exploration) instead of the action believed to have the highest reward (exploitation). Epsilon Decay: This is the process of gradually reducing the value of 𝜖 over time or a number of ... Read More

All Machine Learning MOdels

https://www.ibm.com/think/topics/machine-learning-types Machine learning models are grouped in one of 5 types, supervised, unsupervised, self-supervised, semi-supervised and reinforcement learning. Machine learning libraries, useful.

MNIST PCA

https://medium.com/@azimkhan8018/a-beginners-guide-to-deep-learning-with-mnist-dataset-0894f7183344https://ranasinghiitkgp.medium.com/principal-component-analysis-pca-with-code-on-mnist-dataset-da7de0d07c22https://github.com/mkosaka1/MNIST_PCA_CNN/blob/main/PCA%20%2B%20CNN.ipynbhttps://gist.github.com/tommct/1490cdf856d745ba41c1ac99ada2b579

Reinforcement Learning

https://au.mathworks.com/content/dam/mathworks/ebook/gated/reinforcement-learning-ebook-all-chapters.pdf Reinforcement Learning: Type of machine learning, where a computer agent learns to perform a task through a process of repeated trial and error within a dynamic environment. The goal is to learn what to do, how to map situations to take certain actions that minimize a numerical reward signal. Eg. RL trained computer gains ... Read More

Optimization

Rosenbrock Function $latex f(x,y) = (1-x)^2 + 100(y-x^2)^2$ https://docs.scipy.org/doc/scipy/tutorial/optimize.html

What is Gradient? Loss Function? Model Parameters? Gradient Descent?

Gradient Descent: Optimization algorithm; minimizes errors between predicted and actual results; updates parameters by moving against the gradient Loss Function: aim is to minimize this function, closer to zero; measures how bad the prediction is in comparison to the actual true value; various methods are used; one is Mean Squared Error (MSE) Gradient: Slope; direction ... Read More

Conda Environment

conda create -n env python=version conda activate env conda deactivate Useful tools: Starting jupyter notebook (env) file_path > jupyter notebook

Tensor basics

https://tensorly.org/stable/user_guide/tensor_basics.html Tensor – multi-dimensional array https://tensorly.org/stable/user_guide/tensor_decomposition.html
error: