RAG update

2024-03-28 18:11:27 +01:00 · 2024-03-28 18:11:27 +01:00 · bb79085731
parent 2393b6a5ea
commit bb79085731
8 changed files with 27 additions and 0 deletions
--- a/LLM's/W3-ReinforcementLearning=LLM-poweredApplications.md
+++ b/LLM's/W3-ReinforcementLearning=LLM-poweredApplications.md
@ -44,4 +44,31 @@ The clarity of your instructions can make a big difference on the quality of the

 [Constitutional AI: Harmlessness from AI Feedback paper](images/ConstitutionalAI.pdf)

+## LLM Powered applications
+
+### Introduction Model optimizations for deployment
+
+Increase performance -> reduce LLM size, which reduces inference latency
+The challenge is to reduce the size of the model while still maintaining
+model performance.
+
+![Model Optimizations Techniques](images/optimizationsTechniques1.png)]
+
+[Video has a lot of information](images/ModelOptimizationsfordeployment.mp4)
+
+![Generative AI Project Lifecycle Cheat Sheet](images/GenerativeAIProjectLifecycleCheatSheet.png)
+
+~[LLM-Powered Appplication](images/PowerApplications1.png)
+
+Langchain is an example of Orchestration Library
+
+Retrieval Augmented Generation (**RAG**) is a great way to overcome the knowledge cutoff (because the world has changes since the model was trained with data current to that date) issue and help the model update its understanding of the world.
+
+![RAG](images/RAG1.png)
+
+[Facebook RAG paper](images/RAG_Paper.pdf)
+
+The external data store could be a vector store,a SQL database, CSV files, Wikis or other data storage format.
+
+![RAG](images/RAG2.png)

--- a/LLM's/images/GenerativeAIProjectLifecycleCheatSheet.png
+++ b/LLM's/images/GenerativeAIProjectLifecycleCheatSheet.png
--- a/LLM's/images/ModelOptimizationsfordeployment.mp4
+++ b/LLM's/images/ModelOptimizationsfordeployment.mp4
--- a/LLM's/images/PowerApplications1.png
+++ b/LLM's/images/PowerApplications1.png
--- a/LLM's/images/RAG1.png
+++ b/LLM's/images/RAG1.png
--- a/LLM's/images/RAG2.png
+++ b/LLM's/images/RAG2.png
--- a/LLM's/images/RAG_Paper.pdf
+++ b/LLM's/images/RAG_Paper.pdf
--- a/LLM's/images/optimizationsTechniques1.png
+++ b/LLM's/images/optimizationsTechniques1.png