diff --git a/UseCases/Chat_with_docs_GenAI/Chat_with_docs_OpenSource_GenAI_Python.ipynb b/UseCases/Chat_with_docs_GenAI/Chat_with_docs_OpenSource_GenAI_Python.ipynb
index 8c4f9007..4b468379 100644
--- a/UseCases/Chat_with_docs_GenAI/Chat_with_docs_OpenSource_GenAI_Python.ipynb
+++ b/UseCases/Chat_with_docs_GenAI/Chat_with_docs_OpenSource_GenAI_Python.ipynb
@@ -19,65 +19,65 @@
    "id": "fff71661-19b4-423a-867a-7c815b064c81",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:20px;font-family:Arial;color:#00233C'><b>Introduction:</b></p>\n",
+    "<p style = 'font-size:20px;font-family:Arial'><b>Introduction:</b></p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>In the Chat with documentation system using Generative AI demo, the combination of <b>RAG, Langchain, and LLM models</b> allows users to ask queries in layman's terms, retrieve relevant information from the Vector store, and generate accurate and concise answers based on the retrieved data. This integration of retrieval-based and generative-based approaches provides a powerful tool for extracting knowledge from structured sources and delivering user-friendly responses.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>In the Chat with documentation system using Generative AI demo, the combination of <b>RAG, Langchain, and LLM models</b> allows users to ask queries in layman's terms, retrieve relevant information from the Vector store, and generate accurate and concise answers based on the retrieved data. This integration of retrieval-based and generative-based approaches provides a powerful tool for extracting knowledge from structured sources and delivering user-friendly responses.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>In this demo we will build Chatbot using Panel (for chat UI), LangChain, a powerful library for working with LLMs like GPT-3.5, GPT-4, Bloom, etc. and JumpStart in ClearScape notebooks, a system is built where users can ask business questions in natural English and receive answers with data drawn from the relevant databases.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>In this demo we will build Chatbot using Panel (for chat UI), LangChain, a powerful library for working with LLMs like GPT-3.5, GPT-4, Bloom, etc. and JumpStart in ClearScape notebooks, a system is built where users can ask business questions in natural English and receive answers with data drawn from the relevant databases.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The following diagram illustrates the architecture.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The following diagram illustrates the architecture.</p>\n",
     "\n",
     "<center><img src=\"images/header2.png\" alt=\"architecture\"  width=800 height=800/></center>\n",
     "\n",
     "\n",
     "<br>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Before going any farther, let's get a better understanding of RAG, LangChain, and LLM.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>Before going any farther, let's get a better understanding of RAG, LangChain, and LLM.</p>\n",
     "\n",
-    "<ol style = 'font-size:16px;font-family:Arial;color:#00233C'><b><li> Retrieval-Augmented Generation (RAG):</li></b></ol>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'> &emsp;  &emsp;RAG is a framework that combines the strengths of retrieval-based and generative-based approaches in question-answering systems.It utilizes both a retrieval model and a generative model to generate high-quality answers to user queries. The retrieval model is responsible for retrieving relevant information from a knowledge source, such as a database or documents. The generative model then takes the retrieved information as input and generates concise and accurate answers in natural language.</p>\n",
+    "<ol style = 'font-size:16px;font-family:Arial'><b><li> Retrieval-Augmented Generation (RAG):</li></b></ol>\n",
+    "<p style = 'font-size:16px;font-family:Arial'> &emsp;  &emsp;RAG is a framework that combines the strengths of retrieval-based and generative-based approaches in question-answering systems.It utilizes both a retrieval model and a generative model to generate high-quality answers to user queries. The retrieval model is responsible for retrieving relevant information from a knowledge source, such as a database or documents. The generative model then takes the retrieved information as input and generates concise and accurate answers in natural language.</p>\n",
     "\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>A typical RAG (Retrieval-and-Generation) application has two main components:</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>A typical RAG (Retrieval-and-Generation) application has two main components:</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Indexing:</b> a pipeline for ingesting data from a source and indexing it. This usually happens offline. The indexing process involves several steps, including loading the data, splitting it into smaller chunks, and storing and indexing the splits. This is often done using a VectorStore and Embeddings model.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'><b>Indexing:</b> a pipeline for ingesting data from a source and indexing it. This usually happens offline. The indexing process involves several steps, including loading the data, splitting it into smaller chunks, and storing and indexing the splits. This is often done using a VectorStore and Embeddings model.</p>\n",
     "    \n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Retrieval and generation:</b> the actual RAG chain, which takes the user query at run time and retrieves the relevant data from the index, then passes that to the model. The retrieval process involves searching the index for the most relevant data based on the user query, and then passing that data to the model for generation.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'><b>Retrieval and generation:</b> the actual RAG chain, which takes the user query at run time and retrieves the relevant data from the index, then passes that to the model. The retrieval process involves searching the index for the most relevant data based on the user query, and then passing that data to the model for generation.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The most common full sequence from raw data to answer looks like:</p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Indexing</b></p>\n",
-    "<ul style = 'font-size:16px;font-family:Arial;color:#00233C'>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The most common full sequence from raw data to answer looks like:</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'><b>Indexing</b></p>\n",
+    "<ul style = 'font-size:16px;font-family:Arial'>\n",
     "    <li><b>Load:</b> Load: First we need to load our data. We'll use <code>PyPDFLoader</code> for this.</li>\n",
     "    <li><b>Split:</b> Text splitters break large Documents into smaller chunks. This is useful both for indexing data and for passing it in to a model, since large chunks are harder to search over and won't in a model's finite context window. Here, our pdf document will be splits into pages.</li>\n",
     "    <li><b>Store:</b> We need somewhere to store and index our splits, so that they can later be searched over. This is often done using a VectorStore and Embeddings model</li>\n",
     "    </ul>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The following diagram illustrates the architecture of load, split and store.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The following diagram illustrates the architecture of load, split and store.</p>\n",
     "\n",
     "<center><img src=\"images/rag_load_store.png\" alt=\"rag indexing architecture\" width=800 height=600/></center>\n",
     "<center>image source: <a href=\"https://python.langchain.com/docs/use_cases/question_answering/\">langchain.com</a></center>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Retrieval and generation</b></p>\n",
-    "<ul style = 'font-size:16px;font-family:Arial;color:#00233C'>\n",
+    "<p style = 'font-size:16px;font-family:Arial'><b>Retrieval and generation</b></p>\n",
+    "<ul style = 'font-size:16px;font-family:Arial'>\n",
     "    <li><b>Retrieval:</b> At run time, the user enters a query, which is passed to the model for retrieval. The model searches the index for the most relevant data based on the user query, and returns the results.</li>\n",
     "    <li><b>Generation:</b> Finally, the model generates an answer based on the retrieved data. The answer is then presented to the user.</li>\n",
     "    </ul>\n",
     "    \n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The following diagram illustrates the architecture of retrieval and generation.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The following diagram illustrates the architecture of retrieval and generation.</p>\n",
     "<center><img src=\"images/rag_retrieval_generation.png\" alt=\"retrieval generation architecture\" width=800 height=600/></center>\n",
     "<center>image source: <a href=\"https://python.langchain.com/docs/use_cases/question_answering/\">langchain.com</a></center>\n",
     "\n",
-    "<ol style = 'font-size:16px;font-family:Arial;color:#00233C' start=\"2\"><b><li> Langchain:</li></b></ol>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'> &emsp;  &emsp; LangChain is a framework that facilitates the integration and chaining of large language models with other tools and sources to build more sophisticated AI applications. LangChain does not serve its own LLMs; instead, it provides a standard way of communicating with a variety of LLMs, including those from OpenAI and HuggingFace. LangChain accelerates the development of AI applications with building blocks. We learn the leverage the following building blocks in this notebook:</p>\n",
+    "<ol style = 'font-size:16px;font-family:Arial' start=\"2\"><b><li> Langchain:</li></b></ol>\n",
+    "<p style = 'font-size:16px;font-family:Arial'> &emsp;  &emsp; LangChain is a framework that facilitates the integration and chaining of large language models with other tools and sources to build more sophisticated AI applications. LangChain does not serve its own LLMs; instead, it provides a standard way of communicating with a variety of LLMs, including those from OpenAI and HuggingFace. LangChain accelerates the development of AI applications with building blocks. We learn the leverage the following building blocks in this notebook:</p>\n",
     " \n",
-    "<ol style = 'font-size:16px;font-family:Arial;color:#00233C'>\n",
+    "<ol style = 'font-size:16px;font-family:Arial'>\n",
     "    <li> <b> LLMs</b> – LangChain's <code>llm</code> class is designed to provide a standard interface for all LLM it supports.   </li>\n",
     "    <li> <b> PromptTemplate</b>  - LangChain’s <code>PromptTemplate</code> class are predefined structures for generating prompts for LLM’s. They can be reused across different LLM's.</li>\n",
     "    <li> <b> Chains</b> – When we build complex AI applications, we may need to combine multiple calls to LLM’s and to other components  LangChain’s <code>chain</code> class allows us to link calls to LLM’s and components. The most common type of chaining in any LLM application is combining a prompt template with an LLM and optionally an output parser. </li>\n",
     "</ol>\n",
     "\n",
-    "<ol style = 'font-size:16px;font-family:Arial;color:#00233C' start=\"3\"><b><li> LLM Models (Large Language Models):</li></b></ol>\n",
+    "<ol style = 'font-size:16px;font-family:Arial' start=\"3\"><b><li> LLM Models (Large Language Models):</li></b></ol>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'> &emsp;  &emsp; LLM models refer to the large-scale language models that are trained on vast amounts of text data.\n",
+    "<p style = 'font-size:16px;font-family:Arial'> &emsp;  &emsp; LLM models refer to the large-scale language models that are trained on vast amounts of text data.\n",
     "These models, such as GPT-3 (Generative Pre-trained Transformer 3),  GPT-3.5, GPT-4, HuggingFace BLOOM, LLaMA, Google's FLAN-T5, etc. are capable of generating human-like text responses. LLM models have been pre-trained on diverse sources of text data, enabling them to learn patterns, grammar, and context from a wide range of topics. They can be fine-tuned for specific tasks, such as question-answering, natural language understanding, and text generation.\n",
     "LLM models have achieved impressive results in various natural language processing tasks and are widely used in AI applications for generating human-like text responses.</p>"
    ]
@@ -87,8 +87,8 @@
    "id": "2ca49b9d-df0b-400a-acf8-0252ab8a2618",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233c'><b>Steps in the analysis:</b></p>\n",
-    "<ol style = 'font-size:16px;font-family:Arial;color:#00233C'>\n",
+    "<p style = 'font-size:16px;font-family:Arial'><b>Steps in the analysis:</b></p>\n",
+    "<ol style = 'font-size:16px;font-family:Arial'>\n",
     "    <li>Configuring the environment</li>\n",
     "    <li>Connect to Vantage</li>\n",
     "    <li>Data Exploration</li>\n",
@@ -102,8 +102,8 @@
    "id": "a833b5bf-74af-42be-8543-3782e1da95dc",
    "metadata": {},
    "source": [
-    "<hr style='height:2px;border:none;background-color:#00233C;'>\n",
-    "<b style = 'font-size:28px;font-family:Arial;color:#00233c'>1. Configuring the environment</b>"
+    "<hr style='height:2px;border:none;'>\n",
+    "<b style = 'font-size:28px;font-family:Arial'>1. Configuring the environment</b>"
    ]
   },
   {
@@ -124,11 +124,11 @@
    "metadata": {},
    "source": [
     "<div class=\"alert alert-block alert-info\">\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Note: </b><i>The above statements will install the required libraries to run this demo. Be sure to restart the kernel after executing the above lines to bring the installed libraries into memory. The simplest way to restart the Kernel is by typing zero zero: <b> 0 0</b></i></p>\n",
+    "    <p style = 'font-size:16px;font-family:Arial'><b>Note: </b><i>The above statements will install the required libraries to run this demo. Be sure to restart the kernel after executing the above lines to bring the installed libraries into memory. The simplest way to restart the Kernel is by typing zero zero: <b> 0 0</b></i></p>\n",
     "    </div>\n",
     "    \n",
     "<div class=\"alert alert-block alert-info\">\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Note: </b><i>To ensure that the Chatbot interface reflects the latest changes, please reload the page by clicking the 'Reload' button or pressing F5 on your keyboard for <b>first-time only</b> This will update the notebook with the latest modifications, and you'll be able to interact with the Chatbot using the new libraries.</i></p>\n",
+    "    <p style = 'font-size:16px;font-family:Arial'><b>Note: </b><i>To ensure that the Chatbot interface reflects the latest changes, please reload the page by clicking the 'Reload' button or pressing F5 on your keyboard for <b>first-time only</b> This will update the notebook with the latest modifications, and you'll be able to interact with the Chatbot using the new libraries.</i></p>\n",
     "    </div>"
    ]
   },
@@ -137,10 +137,10 @@
    "id": "61067c88-2e9a-4c92-985b-34dc4ab74a13",
    "metadata": {},
    "source": [
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>1.1 Import the required libraries</b></p>\n",
+    "<hr style='height:1px;border:none;'>\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>1.1 Import the required libraries</b></p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Here, we import the required libraries, set environment variables and environment paths (if required).</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>Here, we import the required libraries, set environment variables and environment paths (if required).</p>"
    ]
   },
   {
@@ -179,8 +179,8 @@
    "id": "a59718f8-7af4-4d1a-abc7-a860eb7cbae3",
    "metadata": {},
    "source": [
-    "<hr style='height:2px;border:none;background-color:#00233C;'>\n",
-    "<b style = 'font-size:28px;font-family:Arial;color:#00233c'>2. Connect to OpenAI</b>"
+    "<hr style='height:2px;border:none;'>\n",
+    "<b style = 'font-size:28px;font-family:Arial'>2. Connect to OpenAI</b>"
    ]
   },
   {
@@ -188,7 +188,7 @@
    "id": "4153b889-0924-4e0f-acc8-6b0d440c77b3",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>2.1 Get the OpenAI API key</b></p>"
+    "<p style = 'font-size:18px;font-family:Arial'><b>2.1 Get the OpenAI API key</b></p>"
    ]
   },
   {
@@ -196,7 +196,7 @@
    "id": "73e10add-4b3c-4fc5-9359-0b44737bba0f",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>In order to utilize this demo, you will need an OpenAI API key. If you do not have one, please refer to the instructions provided in this guide to obtain your OpenAI API key: </p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>In order to utilize this demo, you will need an OpenAI API key. If you do not have one, please refer to the instructions provided in this guide to obtain your OpenAI API key: </p>\n",
     "\n",
     "[Openai_setup_api_key_guide](..//Openai_setup_api_key/Openai_setup_api_key.md)"
    ]
@@ -219,7 +219,7 @@
    "id": "b22db506-84a7-406b-be9f-9fe69d268ba4",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Begin running steps with Shift + Enter keys. </p>"
+    "<p style = 'font-size:16px;font-family:Arial'>Begin running steps with Shift + Enter keys. </p>"
    ]
   },
   {
@@ -227,12 +227,12 @@
    "id": "490f3fe2-ed63-4838-bb19-4c0d0157453d",
    "metadata": {},
    "source": [
-    "<hr style='height:2px;border:none;background-color:#00233C;'>\n",
-    "<b style = 'font-size:28px;font-family:Arial;color:#00233c'>3. Data Exploration</b>\n",
+    "<hr style='height:2px;border:none;'>\n",
+    "<b style = 'font-size:28px;font-family:Arial'>3. Data Exploration</b>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The Chat with documentation demo aims to demonstrate how users can interact with documents such as insurance policy wordings, invoices, and other similar documents through a conversational interface.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The Chat with documentation demo aims to demonstrate how users can interact with documents such as insurance policy wordings, invoices, and other similar documents through a conversational interface.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The Traveler Easy Single Trip - International insurance policy is a comprehensive travel insurance plan that provides cover for a wide range of risks, including medical expenses, trip cancellation, loss of luggage, and personal accident. The policy is designed to be affordable and flexible, and it can be purchased online or over the phone.<p/>"
+    "<p style = 'font-size:16px;font-family:Arial'>The Traveler Easy Single Trip - International insurance policy is a comprehensive travel insurance plan that provides cover for a wide range of risks, including medical expenses, trip cancellation, loss of luggage, and personal accident. The policy is designed to be affordable and flexible, and it can be purchased online or over the phone.<p/>"
    ]
   },
   {
@@ -240,9 +240,9 @@
    "id": "5c1f35d2-fc39-4a0a-9008-3850cd58e50e",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The source data from <a href=\"https://axa-com-my.cdn.axa-contento-118412.eu/axa-com-my/3d2f84a5-42b9-459b-911a-710546df0633_Policy+wording+-+SmartTraveller+Easy+Single+Trip+-+International+%280820%29.pdf\">AXA</a> is loaded in FAISS as Vector Database.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The source data from <a href=\"https://axa-com-my.cdn.axa-contento-118412.eu/axa-com-my/3d2f84a5-42b9-459b-911a-710546df0633_Policy+wording+-+SmartTraveller+Easy+Single+Trip+-+International+%280820%29.pdf\">AXA</a> is loaded in FAISS as Vector Database.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Now, let's use <code>PyPDFLoader</code> library to read the pdf document and split it into pages.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>Now, let's use <code>PyPDFLoader</code> library to read the pdf document and split it into pages.</p>"
    ]
   },
   {
@@ -262,7 +262,7 @@
    "id": "b0d6d954-b72f-41c3-9f8c-bc15cf74d672",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>There are 24 pages that describe the policy in detail.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>There are 24 pages that describe the policy in detail.</p>"
    ]
   },
   {
@@ -270,8 +270,8 @@
    "id": "89367d27-22ca-4a0e-b299-0f8e1a669157",
    "metadata": {},
    "source": [
-    "<hr style='height: 2px;border:none;background-color:#00233C;'>\n",
-    "<b style = 'font-size:28px;font-family:Arial;color:#00233c'>4. LLM </b>"
+    "<hr style='height: 2px;border:none;'>\n",
+    "<b style = 'font-size:28px;font-family:Arial'>4. LLM </b>"
    ]
   },
   {
@@ -279,16 +279,16 @@
    "id": "680228a2-3bfa-44f4-8a02-b73fa8bf8e95",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>4.1 Define LLM model</b></p>  \n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>4.1 Define LLM model</b></p>  \n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>In OpenAI's language models, the <b>temperature</b> parameter controls the randomness of the generated text. It affects the diversity and creativity of the model's responses. It is always a number between 0 and 1. A temperature of 0 means the responses will be very straightforward, almost deterministic (meaning you almost always get the same response to a given prompt). A temperature of 1 means the responses can vary wildly.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>In OpenAI's language models, the <b>temperature</b> parameter controls the randomness of the generated text. It affects the diversity and creativity of the model's responses. It is always a number between 0 and 1. A temperature of 0 means the responses will be very straightforward, almost deterministic (meaning you almost always get the same response to a given prompt). A temperature of 1 means the responses can vary wildly.</p>\n",
     "\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>A higher temperature value, such as 1.0, increases the randomness and diversity of the generated output. This can lead to more varied and surprising responses, but it may also result in less coherence and occasional nonsensical outputs. A higher temperature means that the model might select a word with slightly lower probability, leading to more variation, randomness and creativity. A very high temperature therefore increases the risk of <b>hallucination</b>, meaning that the model starts selecting words that will make no sense or be off-topic.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>A higher temperature value, such as 1.0, increases the randomness and diversity of the generated output. This can lead to more varied and surprising responses, but it may also result in less coherence and occasional nonsensical outputs. A higher temperature means that the model might select a word with slightly lower probability, leading to more variation, randomness and creativity. A very high temperature therefore increases the risk of <b>hallucination</b>, meaning that the model starts selecting words that will make no sense or be off-topic.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>On the other hand, a lower temperature value, such as 0.2 or below, reduces randomness and makes the model's output more focused and deterministic. The generated text is likely to be more conservative, sticking closely to patterns observed in the training data. A temperature of 0 means roughly that the model will always select the highest probability word.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>On the other hand, a lower temperature value, such as 0.2 or below, reduces randomness and makes the model's output more focused and deterministic. The generated text is likely to be more conservative, sticking closely to patterns observed in the training data. A temperature of 0 means roughly that the model will always select the highest probability word.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Choosing an appropriate temperature value depends on the desired output. Higher temperatures can be useful for creative tasks or brainstorming, while lower temperatures are preferred when you need more control over the output, such as when generating specific responses or following a particular style.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>Choosing an appropriate temperature value depends on the desired output. Higher temperatures can be useful for creative tasks or brainstorming, while lower temperatures are preferred when you need more control over the output, such as when generating specific responses or following a particular style.</p>"
    ]
   },
   {
@@ -312,20 +312,20 @@
    "id": "5753a719-44da-460e-91c4-a17b665b25b2",
    "metadata": {},
    "source": [
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
+    "<hr style='height:1px;border:none;'>\n",
     "\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b> 4.2 Create a Vector store</b></p>\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b> 4.2 Create a Vector store</b></p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Vector stores are a powerful tool for storing and searching unstructured data in Language Large Models (LLMs). By embedding unstructured data into numerical vectors, vector stores allow you to easily store and search through vast amounts of data, such as text documents, images, or videos. </p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>One of the key benefits of using a vector store in LLM applications is that it allows you to perform near-instant searches. This means you can quickly find the information you need, without having to manually sift through each piece of data.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>Vector stores are a powerful tool for storing and searching unstructured data in Language Large Models (LLMs). By embedding unstructured data into numerical vectors, vector stores allow you to easily store and search through vast amounts of data, such as text documents, images, or videos. </p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>One of the key benefits of using a vector store in LLM applications is that it allows you to perform near-instant searches. This means you can quickly find the information you need, without having to manually sift through each piece of data.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Vector stores also use advanced similarity algorithms to retrieve the most relevant embedding vectors for your query. This means you can be sure that you're getting the most relevant results, even when dealing with large amounts of data.But that's not all. Vector stores also allow you to store and search through unstructured data in a way that is highly scalable. This means you can easily handle large amounts of data, without having to worry about performance issues.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>Vector stores also use advanced similarity algorithms to retrieve the most relevant embedding vectors for your query. This means you can be sure that you're getting the most relevant results, even when dealing with large amounts of data.But that's not all. Vector stores also allow you to store and search through unstructured data in a way that is highly scalable. This means you can easily handle large amounts of data, without having to worry about performance issues.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>One of the key aspects of working with vector stores is creating the vectors to put in them. This is usually done via embeddings, which are numerical representations of text or other unstructured data. Therefore, it is recommended that you familiarize yourself with the text embedding model interfaces before diving into this.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>One of the key aspects of working with vector stores is creating the vectors to put in them. This is usually done via embeddings, which are numerical representations of text or other unstructured data. Therefore, it is recommended that you familiarize yourself with the text embedding model interfaces before diving into this.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>In this demo, we will use the OpenAI embeddings method to generate the embeddings. OpenAI embeddings are a type of word embedding that can be used to represent products in a way that captures their semantic meaning. To generate embeddings for a insurance document, we will use the pages collected in the previous step by <code>PyPDFLoader</code>. We will use the OpenAI Embeddings API to generate embeddings for entire document. Please refer to the <a href=\"https://platform.openai.com/docs/guides/embeddings\"> Embeddings documentation</a> for more information about embeddings and types of models available.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>In this demo, we will use the OpenAI embeddings method to generate the embeddings. OpenAI embeddings are a type of word embedding that can be used to represent products in a way that captures their semantic meaning. To generate embeddings for a insurance document, we will use the pages collected in the previous step by <code>PyPDFLoader</code>. We will use the OpenAI Embeddings API to generate embeddings for entire document. Please refer to the <a href=\"https://platform.openai.com/docs/guides/embeddings\"> Embeddings documentation</a> for more information about embeddings and types of models available.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The OpenAI Embeddings API takes a text string as input and returns a vector of numbers that represent the embedding. The length of the vector depends on the model that you are using. For example, the text-embedding-ada-002 model returns a vector of 1536 numbers.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>The OpenAI Embeddings API takes a text string as input and returns a vector of numbers that represent the embedding. The length of the vector depends on the model that you are using. For example, the text-embedding-ada-002 model returns a vector of 1536 numbers.</p>"
    ]
   },
   {
@@ -349,7 +349,7 @@
    "id": "b6ea91ff-1832-43a6-96aa-6c0cd7e28f53",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>A <code>retriever</code>  is an interface that allows you to retrieve documents based on an unstructured query. It is more general than a vector store, as it does not need to store documents, only retrieve them. Vector stores can be used as the backbone of a retriever, but there are other types of retrievers as well.One of the most common types of search used in vector stores is similarity search, which compares the similarity between the query and the stored documents to retrieve the most relevant results. This is the type of search used in our vector store retriever.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>A <code>retriever</code>  is an interface that allows you to retrieve documents based on an unstructured query. It is more general than a vector store, as it does not need to store documents, only retrieve them. Vector stores can be used as the backbone of a retriever, but there are other types of retrievers as well.One of the most common types of search used in vector stores is similarity search, which compares the similarity between the query and the stored documents to retrieve the most relevant results. This is the type of search used in our vector store retriever.</p>"
    ]
   },
   {
@@ -357,11 +357,11 @@
    "id": "3a32ecf7-575b-460b-ab16-773a7d7b9e8c",
    "metadata": {},
    "source": [
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
+    "<hr style='height:1px;border:none;'>\n",
     "\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b> 4.3 Create a Prompt templates and Chain</b></p>\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b> 4.3 Create a Prompt templates and Chain</b></p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'><code>PromptTemplate</code> in Language Large Models (LLMs) are pre-defined templates that guide the user in generating prompts for the model. These templates provide a structure for the user to input specific information, such as <b>topic, tone,</b> and <b>style</b>, to help the model generate more accurate and relevant responses. By using prompt templates, users can create more effective prompts and improve the quality of the model's responses. A template may include instructions, few-shot examples, and specific context and questions appropriate for a given task.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'><code>PromptTemplate</code> in Language Large Models (LLMs) are pre-defined templates that guide the user in generating prompts for the model. These templates provide a structure for the user to input specific information, such as <b>topic, tone,</b> and <b>style</b>, to help the model generate more accurate and relevant responses. By using prompt templates, users can create more effective prompts and improve the quality of the model's responses. A template may include instructions, few-shot examples, and specific context and questions appropriate for a given task.</p>"
    ]
   },
   {
@@ -422,11 +422,11 @@
    "id": "c41ea900-7703-4646-a3dd-fdbf93f85a0f",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We have designed this demo to handle follow-up questions with ease, even when they contain references to past chat history. </p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>We have designed this demo to handle follow-up questions with ease, even when they contain references to past chat history. </p>\n",
     "    \n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>For example, if you ask <b>\"Does this policy cover  Loss of or Damage to the Insured’s Articles?\"</b> and then follow up with <b>\"what is the reimbursement limit per Baggage?\"</b>, our system will understand the context and provide an appropriate response.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>For example, if you ask <b>\"Does this policy cover  Loss of or Damage to the Insured’s Articles?\"</b> and then follow up with <b>\"what is the reimbursement limit per Baggage?\"</b>, our system will understand the context and provide an appropriate response.</p>\n",
     "    \n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Here's how it works: we add an additional step prior to retrieval that combines the chat history and the question into a standalone question. This allows us to perform the standard retrieval steps of looking up relevant documents from the retriever and passing those documents and the question into a question answering chain to return a response.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>Here's how it works: we add an additional step prior to retrieval that combines the chat history and the question into a standalone question. This allows us to perform the standard retrieval steps of looking up relevant documents from the retriever and passing those documents and the question into a question answering chain to return a response.</p>"
    ]
   },
   {
@@ -456,10 +456,10 @@
    "id": "a0fe1c6a-191a-4dc7-9b3f-7c282ec94541",
    "metadata": {},
    "source": [
-    "<hr style='height: 2px;border:none;background-color:#00233C;'>\n",
-    "<b style = 'font-size:28px;font-family:Arial;color:#00233c'>5. Chat with documents</b>\n",
+    "<hr style='height: 2px;border:none;'>\n",
+    "<b style = 'font-size:28px;font-family:Arial'>5. Chat with documents</b>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Our chatbot is now ready. You can add questions to ask from the insurance document in the chatbot user interface (UI) that opens in the cell below.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>Our chatbot is now ready. You can add questions to ask from the insurance document in the chatbot user interface (UI) that opens in the cell below.</p>"
    ]
   },
   {
@@ -507,7 +507,7 @@
    "metadata": {},
    "source": [
     "<div class=\"alert alert-block alert-info\">\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'>If the chatbot didn't work when you pressed ENTER, on your first time using this demo on your environment, did you use F5 to reload the site? See instructions at the top of the notebook.\n",
+    "    <p style = 'font-size:16px;font-family:Arial'>If the chatbot didn't work when you pressed ENTER, on your first time using this demo on your environment, did you use F5 to reload the site? See instructions at the top of the notebook.\n",
     "If you asked a question and got no response after a few minutes, it is possible that you will need to type 0 0 to restart the kernel and re-run the demo. Questions outside the model seem to confuse the chatbot.</p></div>"
    ]
   },
@@ -516,8 +516,8 @@
    "id": "50b8b817-c0ff-42b3-a651-c8268ac40942",
    "metadata": {},
    "source": [
-    "<hr style='height: 2px;border:none;background-color:#00233C;'>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>In conclusion, the Chat with documentation demo is important because it showcases the potential of conversational interfaces to facilitate user interaction with various types of documents. This technology can make it easier for users to access and understand complex information, such as insurance policy wordings and invoices. </p>"
+    "<hr style='height: 2px;border:none;'>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>In conclusion, the Chat with documentation demo is important because it showcases the potential of conversational interfaces to facilitate user interaction with various types of documents. This technology can make it easier for users to access and understand complex information, such as insurance policy wordings and invoices. </p>"
    ]
   },
   {
@@ -525,7 +525,7 @@
    "id": "e5463848-592f-4321-b852-287e133872dd",
    "metadata": {},
    "source": [
-    "<footer style=\"padding-bottom:35px; background:#f9f9f9; border-bottom:3px solid #00233C\">\n",
+    "<footer style=\"padding-bottom:35px; border-bottom:3px solid #91A0Ab\">\n",
     "    <div style=\"float:left;margin-top:14px\">ClearScape Analytics™</div>\n",
     "    <div style=\"float:right;\">\n",
     "        <div style=\"float:left; margin-top:14px\">\n",
@@ -552,7 +552,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.9.10"
+   "version": "3.11.14"
   }
  },
  "nbformat": 4,
diff --git a/UseCases/Chat_with_docs_GenAI/Chat_with_docs_VantageDB_GenAI_Python.ipynb b/UseCases/Chat_with_docs_GenAI/Chat_with_docs_VantageDB_GenAI_Python.ipynb
index 17f16b3b..8bd28392 100644
--- a/UseCases/Chat_with_docs_GenAI/Chat_with_docs_VantageDB_GenAI_Python.ipynb
+++ b/UseCases/Chat_with_docs_GenAI/Chat_with_docs_VantageDB_GenAI_Python.ipynb
@@ -603,19 +603,19 @@
     "from IPython.display import display, Markdown\n",
     "\n",
     "def get_section5_desc_start():\n",
-    "    return \"\"\"<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The code above first reads the data from the files. The files contain information about the product embeddings and the customer's searched product embeddings. The code then loads the data into a permanent table in SQL. Once the data is loaded, we will use the Vantage in-database function <code>VectorDistance</code> to calculate the distance between the product embeddings and the customer's searched product embeddings. The data contains product embeddings, which are lists of numerical values, or vectors.</p>\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'>The embeddings file contains over 1500 records, each with 1,536 numerical features. This means that the file is quite large and it may take some time to load it into SQL.</p>\n",
+    "    return \"\"\"<p style = 'font-size:16px;font-family:Arial'>The code above first reads the data from the files. The files contain information about the product embeddings and the customer's searched product embeddings. The code then loads the data into a permanent table in SQL. Once the data is loaded, we will use the Vantage in-database function <code>VectorDistance</code> to calculate the distance between the product embeddings and the customer's searched product embeddings. The data contains product embeddings, which are lists of numerical values, or vectors.</p>\n",
+    "    <p style = 'font-size:16px;font-family:Arial'>The embeddings file contains over 1500 records, each with 1,536 numerical features. This means that the file is quite large and it may take some time to load it into SQL.</p>\n",
     "    <div class=\"alert alert-block alert-info\" id=\"no-azure\">\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'><i><b>Note</b>: Please be patient. The code above is loading data from files and copying it to SQL. This process may take 30-50 seconds.</i></p>\n",
+    "    <p style = 'font-size:16px;font-family:Arial'><i><b>Note</b>: Please be patient. The code above is loading data from files and copying it to SQL. This process may take 30-50 seconds.</i></p>\n",
     "    </div>\"\"\"\n",
     "\n",
     "def get_section5_desc_end():\n",
-    "    return \"\"\"<a id='section52'></a><p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>5.2 Display the product embeddings</b></p>\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'>To give you a better idea of what the embeddings look like, here are the first five rows of the product embeddings:</p>\"\"\"\n",
+    "    return \"\"\"<a id='section52'></a><p style = 'font-size:18px;font-family:Arial'><b>5.2 Display the product embeddings</b></p>\n",
+    "    <p style = 'font-size:16px;font-family:Arial'>To give you a better idea of what the embeddings look like, here are the first five rows of the product embeddings:</p>\"\"\"\n",
     "\n",
     "def get_section5_desc_sample():\n",
-    "    return \"\"\"<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We can see that generated embeddings for all of the products are in vector of 1536 columns. </p>\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'>For example: The generated embeddings for product name: <b>What is the email address of Customer Service Department?</b> consists of 1536 numbers and looks like:<br>\n",
+    "    return \"\"\"<p style = 'font-size:16px;font-family:Arial'>We can see that generated embeddings for all of the products are in vector of 1536 columns. </p>\n",
+    "    <p style = 'font-size:16px;font-family:Arial'>For example: The generated embeddings for product name: <b>What is the email address of Customer Service Department?</b> consists of 1536 numbers and looks like:<br>\n",
     "    <code>-0.038744\t-0.016937\t-0.017475\t0.003624\t0.00744\t-0.00275\t0.02374</code></p>\"\"\"\n",
     "\n",
     "def load_the_emb():\n",
@@ -649,7 +649,7 @@
     "        display(\n",
     "            Markdown(\n",
     "                \"\"\"<br><div class=\"alert alert-block alert-success\">\n",
-    "        <p style = 'font-size:16px;font-family:Arial;color:#00233C'><i>Section 4: Generate the embeddings is already executed! So, skipping the execution of above code.</i></p></div>\"\"\"\n",
+    "        <p style = 'font-size:16px;font-family:Arial'><i>Section 4: Generate the embeddings is already executed! So, skipping the execution of above code.</i></p></div>\"\"\"\n",
     "            )\n",
     "        )\n",
     "\n",
@@ -1143,7 +1143,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.9.10"
+   "version": "3.11.14"
   }
  },
  "nbformat": 4,
diff --git a/UseCases/Chat_with_docs_GenAI/Initialization_and_Model_Load.ipynb b/UseCases/Chat_with_docs_GenAI/Initialization_and_Model_Load.ipynb
deleted file mode 100644
index d52809d5..00000000
--- a/UseCases/Chat_with_docs_GenAI/Initialization_and_Model_Load.ipynb
+++ /dev/null
@@ -1,474 +0,0 @@
-{
- "cells": [
-  {
-   "cell_type": "markdown",
-   "id": "601d9a90-f9e1-4d0e-8136-5b89bd7e5dcc",
-   "metadata": {},
-   "source": [
-    "<header>\n",
-    "   <p  style='font-size:36px;font-family:Arial; color:#F0F0F0; background-color: #00233c; padding-left: 20pt; padding-top: 20pt;padding-bottom: 10pt; padding-right: 20pt;'>\n",
-    "       Language Models<br>\n",
-    "   <span style=\"font-size: 24px;\">An Introduction to Parallel CPU Inferencing of HuggingFace Models in Vantage</span>\n",
-    "       \n",
-    "  <br>\n",
-    "       <img id=\"teradata-logo\" src=\"https://storage.googleapis.com/clearscape_analytics_demo_data/DEMO_Logo/teradata.svg\" alt=\"Teradata\" style=\"width: 125px; height: auto; margin-top: 20pt;\">\n",
-    "    </p>\n",
-    "</header>"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "b738d7d9-c96f-4816-ba0d-c480ba1560e2",
-   "metadata": {},
-   "source": [
-    "<p style = 'font-size:20px;font-family:Arial;color:#00233C'><b>Introduction</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>\n",
-    "Hugging Face is a French-American company based in New York City that develops computation tools for building applications using machine learning. They are known for their <b>Transformers Library</b> which provides open-source implementations of transformer models for text, image, video, audio tasks including time-series. These models include well-known architectures like BERT and GPT. The library is compatible with PyTorch, TensorFlow, and JAX deep learning libraries. <br>\n",
-    "    Deep Learning Models in HuggingFace are pre-trained by users/open source outfits/companies on various types of data – NLP, Audio, Images, Videos etc. Most popular tool of choice by users is PyTorch (open source python library) which helps create a Deep Learning model from scratch or take an existing model, retrain/fine-tune (Transfer Learning) on new set of data to be published in HF. Models can be inference with CPUs and GPUs with slight performance improvement for smaller models.<br>\n",
-    "</p>\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233C'><b>Why Vantage?</b></p>  \n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>As many Hugging Face models are available in <b>ONNX Runtime</b>, we can load them using the <b>BYOM</b> feature of Vantage and run them in Vantage. Because of <b>Graph Optimizations</b> on ONNX Runtime, there are proven benchmarks that show that inference with <b>ONNX Runtime will be 20% faster than a native PyTorch model on a CPU</b>. </p>\n",
-    "    \n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Vantage Parallelism</b> on top of boosted ONNX Runtime inference can turn a Vantage system as effective as inference on GPUs. If we have a <b>Vantage box with 72 AMPs</b>, assuming the table is perfectly distributed, it will <b>closely match the performance of a dedicated GPU and data never moves across the network saving time and I/O operations</b>. As parallelism increases with number of AMPs, the model inference will complete faster in Teradata Vantage with the same amount of text data vs a GPU. We can of course quantize the model (change float8 weights to int8/int4) for inference on CPU to go even faster with some tradeoff with accuracy. However, If Model size goes up GPU advantage will widen – example LLM like LLama3 and costs will be disproportionate with GPU but for smaller models we can get comparable performance. \n",
-    "</p>\n",
-    "\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233C'><b>Overall flow:</b></p>  "
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "82a30786-eb41-4518-9abb-dd8bbaa7a640",
-   "metadata": {},
-   "source": [
-    "<center><img src=\"images/pat1.png\" alt=\"Design pattern 1\" width=1200 height=900/></center>"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "9101f149-3076-4bab-aec5-5f1a638cfa44",
-   "metadata": {},
-   "source": [
-    "<hr style='height:2px;border:none;background-color:#00233C;'>\n",
-    "<b style = 'font-size:20px;font-family:Arial;color:#00233c'>1. Configuring the environment</b>\n",
-    "\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>1.1 Install the required libraries</b></p>"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "7c6405d1-ea63-42e6-a012-2aedb45f0228",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "%%capture\n",
-    "\n",
-    "!pip install optimum sentence_transformers"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "968c2d7b-bac0-404d-bab8-6ce396a83711",
-   "metadata": {},
-   "source": [
-    "<div class=\"alert alert-block alert-info\">\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Note: </b><i>Please restart the kernel after executing these two lines. The simplest way to restart the Kernel is by typing zero zero: <b> 0 0</b></i></p>"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "5ce275db-9aa2-424f-88bd-edc91f2dcf84",
-   "metadata": {},
-   "source": [
-    "<hr style=\"height:2px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>1.2 Import the required libraries</b></p>\n",
-    "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Here, we import the required libraries and set environment variables and environment paths (if required).</p>"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "678dd728-e926-41c8-8c95-1f9a371ed873",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "# Standard libraries\n",
-    "import warnings\n",
-    "import json\n",
-    "\n",
-    "# Teradata libraries\n",
-    "from teradataml import (\n",
-    "    create_context,\n",
-    "    delete_byom,\n",
-    "    display,\n",
-    "    execute_sql,\n",
-    "    save_byom,\n",
-    "    remove_context,\n",
-    ")\n",
-    "\n",
-    "display.max_rows = 5\n",
-    "\n",
-    "# sentence transformers\n",
-    "import onnx\n",
-    "import onnxruntime as rt\n",
-    "import transformers\n",
-    "from onnxruntime.tools.onnx_model_utils import *\n",
-    "from sentence_transformers.util import cos_sim\n",
-    "from sentence_transformers import SentenceTransformer\n",
-    "\n",
-    "# Suppress warnings\n",
-    "warnings.filterwarnings(\"ignore\")\n",
-    "warnings.simplefilter(action=\"ignore\", category=DeprecationWarning)\n",
-    "warnings.simplefilter(action=\"ignore\", category=RuntimeWarning)\n",
-    "warnings.simplefilter(action=\"ignore\", category=FutureWarning)"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "4b218aee-06db-4377-b170-8b1423029bce",
-   "metadata": {},
-   "source": [
-    "<hr style=\"height:2px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:20px;font-family:Arial;color:#00233C'><b>2. Connect to Vantage</b></p>\n",
-    "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We will be prompted to provide the password. We will enter the password, press the Enter key, and then use the down arrow to go to the next cell. Begin running steps with Shift + Enter keys.</p>"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "a7fd4855-ac57-4241-9d12-024b0d2cbae0",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "%run -i ../startup.ipynb\n",
-    "eng = create_context(host='host.docker.internal', username='demo_user', password=password)\n",
-    "print(eng)"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "50ccda78-2923-49ed-873d-58183d725c9b",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "%%capture\n",
-    "execute_sql(\"SET query_band='DEMO=Language_Model_Init_Python.ipynb;' UPDATE FOR SESSION;\")"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "e891059d-98b1-44ef-b956-5310fe9eac82",
-   "metadata": {},
-   "source": [
-    "<hr style=\"height:2px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:20px;font-family:Arial;color:#00233C'><b>3. Creation of functions</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Below command will create the database and functions required for text summarization and embedding models using Huggingface PyTorch models in Vantage.</p>\n",
-    "\n",
-    "<div class=\"alert alert-block alert-info\">\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'><i><b>Note:</b>The installation of the required functions will take approximately <b>5 to 6 minutes</b> for the first-time installation. However, if the libraries are already installed, the execution will complete within 50-60 seconds.</i></p>\n",
-    "</div>"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "8c9159dd-84bb-42e7-acbf-860c8154fdea",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "with open(\"commands.json\", \"r\") as file:\n",
-    "    data = json.load(file)\n",
-    "\n",
-    "for item in data[\"queries\"]:\n",
-    "    try:\n",
-    "        execute_sql(item[\"query\"])\n",
-    "    except Exception as e:\n",
-    "        print(\n",
-    "            f\"The initialization steps have already been executed for this environment!\"\n",
-    "        )\n",
-    "        # print(f\"Error: {e}\")\n",
-    "        pass"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "d65e1d32-6c9d-4076-97a3-e7742ed7c579",
-   "metadata": {},
-   "source": [
-    "<hr style=\"height:2px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:20px;font-family:Arial;color:#00233C'><b>4. HuggingFace Model installation</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>In the below steps we will download and install the HuggingFace Model in Vantage.</p> "
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "6bcd268a-92d9-440d-b5a4-8a85ea0bfdf7",
-   "metadata": {},
-   "source": [
-    "<hr style=\"height:1px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>4.1 Download the Model using Optium utility</b></p>\n",
-    "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We will be using <a href = 'https://huggingface.co/BAAI/bge-small-en-v1.5'>BAAI/bge-small-en-v1.5</a><br> The bge-small-en model is a small-scale English text embedding model developed by BAAI (Beijing Academy of Artificial Intelligence) as part of their FlagEmbedding project.</p>"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "519fafdd-099f-4b6f-902d-8fb9034bf372",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "!optimum-cli export onnx --opset 16 --trust-remote-code --task sentence-similarity -m BAAI/bge-small-en-v1.5 bge-small-en-v1.5-onnx"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "83c0b79d-5f7b-4ed0-9c4a-82942c94c0af",
-   "metadata": {},
-   "source": [
-    "<hr style=\"height:1px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>4.2 Model Preparation</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>In the below steps we will fix dynamic dims, fix versions for compatibility, etc and prepare the model to load in Vantage.</p>"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "524ca739-001a-49e4-808b-88e374fd4834",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "op = onnx.OperatorSetIdProto()\n",
-    "op.version = 16\n",
-    "\n",
-    "model = onnx.load(\"bge-small-en-v1.5-onnx/model.onnx\")\n",
-    "\n",
-    "# to be sure that we have compatible opset and IR version\n",
-    "model_ir8 = onnx.helper.make_model(model.graph, ir_version=8, opset_imports=[op])\n",
-    "\n",
-    "\n",
-    "# fixing the variable dim sizes in our mode\n",
-    "rt.tools.onnx_model_utils.make_dim_param_fixed(model_ir8.graph, \"batch_size\", 1)\n",
-    "rt.tools.onnx_model_utils.make_dim_param_fixed(model_ir8.graph, \"sequence_length\", 512)\n",
-    "rt.tools.onnx_model_utils.make_dim_param_fixed(\n",
-    "    model_ir8.graph, \"Divsentence_embedding_dim_1\", 384\n",
-    ")\n",
-    "\n",
-    "\n",
-    "# remove useless token_embeddings output from the model\n",
-    "for node in model_ir8.graph.output:\n",
-    "    if node.name == \"token_embeddings\":\n",
-    "        model_ir8.graph.output.remove(node)\n",
-    "\n",
-    "# saving the model\n",
-    "onnx.save(model_ir8, \"bge-small-en-v1.5-onnx/model_fixed.onnx\")"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "68aad59a-c5f0-4e55-b915-650114098bc4",
-   "metadata": {},
-   "source": [
-    "<hr style=\"height:1px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>4.3 Model Results validation</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Checking that everything works with ONNX format locally.</p>"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "5da55982-2380-4b61-bc27-55bc4e847bdd",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "sentences_1 = \"How is the weather today?\"\n",
-    "sentences_2 = \"What is the current weather like today?\""
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "7155d04b-92d6-445f-8c1c-67ccb5de8fb8",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "# ONNX tokenizer\n",
-    "tokenizer = transformers.AutoTokenizer.from_pretrained(\"./bge-small-en-v1.5-onnx\")\n",
-    "\n",
-    "# ONNX model\n",
-    "predef_sess = rt.InferenceSession(\"bge-small-en-v1.5-onnx/model_fixed.onnx\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "0ccf9f13-2d81-4df2-b3b6-d8aa5ddb5591",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "# get embeddings for sent1 and sent2 with ONNX model\n",
-    "enc = tokenizer(sentences_1, max_length=512, padding=\"max_length\")\n",
-    "\n",
-    "result = predef_sess.run(\n",
-    "    None, {\"input_ids\": [enc.input_ids], \"attention_mask\": [enc.attention_mask]}\n",
-    ")\n",
-    "\n",
-    "enc2 = tokenizer(sentences_2, max_length=512, padding=\"max_length\")\n",
-    "\n",
-    "result2 = predef_sess.run(\n",
-    "    None, {\"input_ids\": [enc2.input_ids], \"attention_mask\": [enc2.attention_mask]}\n",
-    ")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "37d92421-e70b-4a19-9109-50b917eedced",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "print(cos_sim(result[0][0], result2[0][0]))"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "dd6f9c89-253c-49ec-beec-4db6de11cea1",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "# huggingface model model\n",
-    "model = SentenceTransformer(\"BAAI/bge-small-en-v1.5\")\n",
-    "embeddings_1 = model.encode(sentences_1, normalize_embeddings=True)\n",
-    "embeddings_2 = model.encode(sentences_2, normalize_embeddings=True)\n",
-    "\n",
-    "print(cos_sim(embeddings_1, embeddings_2))"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "6033ea38-677b-4e43-b301-8ada166d0233",
-   "metadata": {},
-   "source": [
-    "<hr style=\"height:1px;border:none;background-color:#00233C;\">\n",
-    "\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>4.4 Deploy Model and Tokenizer</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>In above steps, we have checked that the model is working fine in ONNX format. Now we will deploy the model and tokenizer in database.</p>"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "d42f1694-bf8b-46f4-a9ce-2b21854e8517",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "model_ids = [\"bge-small-en-v1.5\", \"bge-small-en-v1.5\"]\n",
-    "model_files = [\n",
-    "    \"bge-small-en-v1.5-onnx/model_fixed.onnx\",\n",
-    "    \"bge-small-en-v1.5-onnx/tokenizer.json\",\n",
-    "]\n",
-    "table_names = [\"embeddings_models\", \"embeddings_tokenizers\"]\n",
-    "\n",
-    "for model_id, model_file, table_name in zip(model_ids, model_files, table_names):\n",
-    "    try:\n",
-    "        save_byom(model_id=model_id, model_file=model_file, table_name=table_name)\n",
-    "    except Exception as e:\n",
-    "        # if our model exists, delete and rewrite\n",
-    "        if str(e.args).find(\"TDML_2200\") >= 1:\n",
-    "            print(f\"{table_name.split('_')[1][:-1]} already exists in the database\")\n",
-    "            user_conformation = input(\n",
-    "                f\"Do you want to reload the {table_name.split('_')[1][:-1]} (y/n)?\"\n",
-    "            )\n",
-    "            if user_conformation.lower() == \"y\":\n",
-    "                delete_byom(model_id=model_id, table_name=table_name)\n",
-    "                save_byom(\n",
-    "                    model_id=model_id, model_file=model_file, table_name=table_name\n",
-    "                )\n",
-    "            else:\n",
-    "                pass\n",
-    "        else:\n",
-    "            raise ValueError(\n",
-    "                f\"Unable to save the {table_name.split('_')[1][:-1]} '{model_id}' in '{table_name}' due to the following error: {e}\"\n",
-    "            )"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "7fda5b4d-772e-448c-9467-6f52bb8acd1c",
-   "metadata": {},
-   "source": [
-    "<hr style=\"height:2px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:20px;font-family:Arial;color:#00233C'><b>5. Next Steps</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'> Now we have initialized and loaded the model into Vantage.  Now the notebooks listed below can be executed.\n",
-    "<ul style = 'font-size:16px;font-family:Arial;color:#00233C'> \n",
-    "       <li>Semantic Similarity:Does Embeddings on CFPB complaints and uses TD_VECTORDISTANCE to find complaints that match some theme or topic <a href = './Semantic_Similarity_Python.ipynb'>Semantic Similarity </a></li> \n",
-    "     <li>Semantic Clustering:Does Embeddings on CFPB complaints and uses K-MEANS to cluster and does Post-hoc explanations/topic detection on semantic clusters found. <a href = './Semantic_Clustering_Python.ipynb'>Semantic Clustering </a></li> \n",
-    "     <li>RAG Notebook for TD Catalog:Does a dump of TD Catalog Metadata on a table. Does embeddings on both Metadata + language model prompt query. Does Semantic Similarity search of Top N Chunks and hands it off to a LLM to answer the prompt.<a href = './RAG_and_Bedrock_Querycatalogue.ipynb'>RAG and Bedrock to query Catalogue </a></li> \n",
-    "     <li>RAG Notebook for SEC-10K PDF:Demo with some PDF parsing and chunking with a Teradata SEC-10K PDF, creates embedding and uses language model to answer prompts <a href = './RAG_and_Bedrock_QueryPDF.ipynb'>RAG and Bedrock to query Pdf </a></li> \n",
-    "      \n",
-    "</ul>\n",
-    "    </p>"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "c12180f1-5743-4091-a91a-04b605f66062",
-   "metadata": {},
-   "source": [
-    "<hr style=\"height:1px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233C'> <b>Clean up </b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The following code will remove the context.</p>"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "87cdfd16-aa93-4d29-a094-bab58c3c083f",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "remove_context()"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "49407083-7481-463c-8a1f-2762702b78b2",
-   "metadata": {},
-   "source": [
-    "<footer style=\"padding-bottom:35px; background:#f9f9f9; border-bottom:3px solid #00233C\">\n",
-    "    <div style=\"float:left;margin-top:14px\">ClearScape Analytics™</div>\n",
-    "    <div style=\"float:right;\">\n",
-    "        <div style=\"float:left; margin-top:14px\">\n",
-    "            Copyright © Teradata Corporation - 2024. All Rights Reserved\n",
-    "        </div>\n",
-    "    </div>\n",
-    "</footer>"
-   ]
-  }
- ],
- "metadata": {
-  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
-   "language": "python",
-   "name": "python3"
-  },
-  "language_info": {
-   "codemirror_mode": {
-    "name": "ipython",
-    "version": 3
-   },
-   "file_extension": ".py",
-   "mimetype": "text/x-python",
-   "name": "python",
-   "nbconvert_exporter": "python",
-   "pygments_lexer": "ipython3",
-   "version": "3.9.10"
-  }
- },
- "nbformat": 4,
- "nbformat_minor": 5
-}
diff --git a/UseCases/Chat_with_docs_GenAI/Teradata_Enterprise_VectorStore_VectorizingPDFs_GenAI_Python.ipynb b/UseCases/Chat_with_docs_GenAI/Teradata_Enterprise_VectorStore_VectorizingPDFs_GenAI_Python.ipynb
index 16821919..c4ca65c0 100644
--- a/UseCases/Chat_with_docs_GenAI/Teradata_Enterprise_VectorStore_VectorizingPDFs_GenAI_Python.ipynb
+++ b/UseCases/Chat_with_docs_GenAI/Teradata_Enterprise_VectorStore_VectorizingPDFs_GenAI_Python.ipynb
@@ -19,65 +19,65 @@
    "id": "fff71661-19b4-423a-867a-7c815b064c81",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:20px;font-family:Arial;color:#00233C'><b>Introduction:</b></p>\n",
+    "<p style = 'font-size:20px;font-family:Arial'><b>Introduction:</b></p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>In our chat with the documentation and database system using Generative AI, we have combined <b>RAG, Langchain, LLM models, and SQLAgents.</b> This allows us to ask queries in layman's terms, retrieve relevant information from the Vector store and/or Vantage Table, and generate accurate and concise answers based on the retrieved data. This integration of retrieval-based and generative-based approaches provides a powerful tool for extracting knowledge from structured or unstructured sources like PDFs, text, or audio files and delivering user-friendly responses.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>In our chat with the documentation and database system using Generative AI, we have combined <b>RAG, Langchain, LLM models, and SQLAgents.</b> This allows us to ask queries in layman's terms, retrieve relevant information from the Vector store and/or Vantage Table, and generate accurate and concise answers based on the retrieved data. This integration of retrieval-based and generative-based approaches provides a powerful tool for extracting knowledge from structured or unstructured sources like PDFs, text, or audio files and delivering user-friendly responses.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>In this demo we will build Chatbot type feature by using LangChain, a powerful library for working with LLMs like <b>OpenAI's GPT-4, Amazon's Titan, Anthropic Claude 3.5, etc.</b> and JumpStart in ClearScape notebooks, a system is built where users can ask business questions in natural English and receive answers with data drawn from the relevant databases.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>In this demo we will build Chatbot type feature by using LangChain, a powerful library for working with LLMs like <b>OpenAI's GPT-4, Amazon's Titan, Anthropic Claude 3.5, etc.</b> and JumpStart in ClearScape notebooks, a system is built where users can ask business questions in natural English and receive answers with data drawn from the relevant databases.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The following diagram illustrates the architecture.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The following diagram illustrates the architecture.</p>\n",
     "\n",
     "<center><img src=\"images/rag1.png\" alt=\"architecture\"  width=1200 height=1000/></center>\n",
     "\n",
     "\n",
     "<br>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Before going any farther, let's get a better understanding of RAG, LangChain, and LLM.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>Before going any farther, let's get a better understanding of RAG, LangChain, and LLM.</p>\n",
     "\n",
-    "<ol style = 'font-size:16px;font-family:Arial;color:#00233C'><b><li> Retrieval-Augmented Generation (RAG):</li></b></ol>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'> &emsp;  &emsp;RAG is a framework that combines the strengths of retrieval-based and generative-based approaches in question-answering systems.It utilizes both a retrieval model and a generative model to generate high-quality answers to user queries. The retrieval model is responsible for retrieving relevant information from a knowledge source, such as a database or documents. The generative model then takes the retrieved information as input and generates concise and accurate answers in natural language.</p>\n",
+    "<ol style = 'font-size:16px;font-family:Arial'><b><li> Retrieval-Augmented Generation (RAG):</li></b></ol>\n",
+    "<p style = 'font-size:16px;font-family:Arial'> &emsp;  &emsp;RAG is a framework that combines the strengths of retrieval-based and generative-based approaches in question-answering systems.It utilizes both a retrieval model and a generative model to generate high-quality answers to user queries. The retrieval model is responsible for retrieving relevant information from a knowledge source, such as a database or documents. The generative model then takes the retrieved information as input and generates concise and accurate answers in natural language.</p>\n",
     "\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>A typical RAG (Retrieval-and-Generation) application has two main components:</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>A typical RAG (Retrieval-and-Generation) application has two main components:</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Indexing:</b> a pipeline for ingesting data from a source and indexing it. This usually happens offline. The indexing process involves several steps, including loading the data, splitting it into smaller chunks, and storing and indexing the splits. This is often done using a VectorStore and Embeddings model.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'><b>Indexing:</b> a pipeline for ingesting data from a source and indexing it. This usually happens offline. The indexing process involves several steps, including loading the data, splitting it into smaller chunks, and storing and indexing the splits. This is often done using a VectorStore and Embeddings model.</p>\n",
     "    \n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Retrieval and generation:</b> the actual RAG chain, which takes the user query at run time and retrieves the relevant data from the index, then passes that to the model. The retrieval process involves searching the index for the most relevant data based on the user query, and then passing that data to the model for generation.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'><b>Retrieval and generation:</b> the actual RAG chain, which takes the user query at run time and retrieves the relevant data from the index, then passes that to the model. The retrieval process involves searching the index for the most relevant data based on the user query, and then passing that data to the model for generation.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The most common full sequence from raw data to answer looks like:</p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Indexing</b></p>\n",
-    "<ul style = 'font-size:16px;font-family:Arial;color:#00233C'>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The most common full sequence from raw data to answer looks like:</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'><b>Indexing</b></p>\n",
+    "<ul style = 'font-size:16px;font-family:Arial'>\n",
     "    <li><b>Load:</b> Load: First we need to load our data. We'll use <code>PyMuPDFLoader</code> for this.</li>\n",
     "    <li><b>Split:</b> Text splitters break large Documents into smaller chunks. This is useful both for indexing data and for passing it in to a model, since large chunks are harder to search over and won't in a model's finite context window. Here, our pdf document will be splits into pages.</li>\n",
     "    <li><b>Store:</b> We need somewhere to store and index our splits, so that they can later be searched over. This is often done using a VectorStore and Embeddings model</li>\n",
     "    </ul>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The following diagram illustrates the architecture of load, split and store.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The following diagram illustrates the architecture of load, split and store.</p>\n",
     "\n",
     "<center><img src=\"images/rag_load_store.png\" alt=\"rag indexing architecture\"  width=800 height=600/></center>\n",
     "<center>image source: <a href=\"https://python.langchain.com/docs/use_cases/question_answering/\">langchain.com</a></center>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Retrieval and generation</b></p>\n",
-    "<ul style = 'font-size:16px;font-family:Arial;color:#00233C'>\n",
+    "<p style = 'font-size:16px;font-family:Arial'><b>Retrieval and generation</b></p>\n",
+    "<ul style = 'font-size:16px;font-family:Arial'>\n",
     "    <li><b>Retrieval:</b> During runtime, the user inputs a query. We first generate embeddings for it, which are then passed to the Vantage in-db function <b>TD_VectorDistance</b> to retrieve similar documents as context. This context is then fed into the LLM model.</li>\n",
     "    <li><b>Generation:</b> Finally, the model generates an answer based on the retrieved data. The answer is then presented to the user.</li>\n",
     "    </ul>\n",
     "    \n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The following diagram illustrates the architecture of retrieval and generation.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The following diagram illustrates the architecture of retrieval and generation.</p>\n",
     "<center><img src=\"images/rag_retrieval_generation_td.png\" alt=\"retrieval generation architecture\" width=800 height=600/></center>\n",
     "<center>image source: <a href=\"https://python.langchain.com/docs/use_cases/question_answering/\">langchain.com</a></center>\n",
     "\n",
-    "<ol style = 'font-size:16px;font-family:Arial;color:#00233C' start=\"2\"><b><li> Langchain:</li></b></ol>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'> &emsp;  &emsp; LangChain is a framework that facilitates the integration and chaining of large language models with other tools and sources to build more sophisticated AI applications. LangChain does not serve its own LLMs; instead, it provides a standard way of communicating with a variety of LLMs, including those from OpenAI and HuggingFace. LangChain accelerates the development of AI applications with building blocks. We learn the leverage the following building blocks in this notebook:</p>\n",
+    "<ol style = 'font-size:16px;font-family:Arial' start=\"2\"><b><li> Langchain:</li></b></ol>\n",
+    "<p style = 'font-size:16px;font-family:Arial'> &emsp;  &emsp; LangChain is a framework that facilitates the integration and chaining of large language models with other tools and sources to build more sophisticated AI applications. LangChain does not serve its own LLMs; instead, it provides a standard way of communicating with a variety of LLMs, including those from OpenAI and HuggingFace. LangChain accelerates the development of AI applications with building blocks. We learn the leverage the following building blocks in this notebook:</p>\n",
     " \n",
-    "<ol style = 'font-size:16px;font-family:Arial;color:#00233C'>\n",
+    "<ol style = 'font-size:16px;font-family:Arial'>\n",
     "    <li> <b> LLMs</b> – LangChain's <code>llm</code> class is designed to provide a standard interface for all LLM it supports.   </li>\n",
     "    <li> <b> PromptTemplate</b>  - LangChain’s <code>PromptTemplate</code> class are predefined structures for generating prompts for LLM’s. They can be reused across different LLM's.</li>\n",
     "    <li> <b> Chains</b> – When we build complex AI applications, we may need to combine multiple calls to LLM’s and to other components  LangChain’s <code>chain</code> class allows us to link calls to LLM’s and components. The most common type of chaining in any LLM application is combining a prompt template with an LLM and optionally an output parser. </li>\n",
     "</ol>\n",
     "\n",
-    "<ol style = 'font-size:16px;font-family:Arial;color:#00233C' start=\"3\"><b><li> LLM Models (Large Language Models):</li></b></ol>\n",
+    "<ol style = 'font-size:16px;font-family:Arial' start=\"3\"><b><li> LLM Models (Large Language Models):</li></b></ol>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'> &emsp;  &emsp; LLM models refer to the large-scale language models that are trained on vast amounts of text data.\n",
+    "<p style = 'font-size:16px;font-family:Arial'> &emsp;  &emsp; LLM models refer to the large-scale language models that are trained on vast amounts of text data.\n",
     "These models, such as GPT-4, Llama 3,  Google's Gemini 1.5, etc. are capable of generating human-like text responses. LLM models have been pre-trained on diverse sources of text data, enabling them to learn patterns, grammar, and context from a wide range of topics. They can be fine-tuned for specific tasks, such as question-answering, natural language understanding, and text generation.\n",
     "LLM models have achieved impressive results in various natural language processing tasks and are widely used in AI applications for generating human-like text responses.</p>"
    ]
@@ -87,8 +87,8 @@
    "id": "2ca49b9d-df0b-400a-acf8-0252ab8a2618",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233c'><b>Steps in the analysis:</b></p>\n",
-    "<ol style = 'font-size:16px;font-family:Arial;color:#00233C'>\n",
+    "<p style = 'font-size:16px;font-family:Arial'><b>Steps in the analysis:</b></p>\n",
+    "<ol style = 'font-size:16px;font-family:Arial'>\n",
     "    <li>Configuring the environment</li>\n",
     "    <li>Connect to Vantage</li>\n",
     "    <li>Data Exploration Getting Data for This Demo</li>\n",
@@ -108,8 +108,8 @@
    "id": "a833b5bf-74af-42be-8543-3782e1da95dc",
    "metadata": {},
    "source": [
-    "<hr style='height:2px;border:none;background-color:#00233C;'>\n",
-    "<b style = 'font-size:20px;font-family:Arial;color:#00233c'>1. Configuring the environment</b>"
+    "<hr style='height:2px;border:none;'>\n",
+    "<b style = 'font-size:20px;font-family:Arial'>1. Configuring the environment</b>"
    ]
   },
   {
@@ -117,36 +117,36 @@
    "id": "4d94f1e8-489b-4084-80c6-c9cff1ef6ee8",
    "metadata": {},
    "source": [
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>1.1 Install the required libraries</b></p>\n",
+    "<hr style='height:1px;border:none;'>\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>1.1 Install the required libraries</b></p>\n",
     "\n",
     "<div class=\"alert alert-block alert-info\">\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'><i><b>Note:</b>The installation of the required libraries will take approximately <b>4 to 5 minutes</b> for the first-time installation. However, if the libraries are already installed, the execution will complete within 5 seconds.</i></p>\n",
+    "    <p style = 'font-size:16px;font-family:Arial'><i><b>Note:</b>The installation of the required libraries will take approximately <b>4 to 5 minutes</b> for the first-time installation. However, if the libraries are already installed, the execution will complete within 5 seconds.</i></p>\n",
     "</div>"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "10150f81-a951-4bb4-80de-56ae58c88afa",
+   "id": "9a8bf123-c1c7-4b3a-8c42-404be0936ef0",
    "metadata": {},
    "outputs": [],
    "source": [
     "%%capture\n",
     "\n",
-    "!pip install torchaudio transformers"
+    "!pip install --upgrade -r requirements.txt --quiet"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "2f6027a7-888d-441f-abc7-a6ea1c45f0a6",
+   "id": "8e0bd85d-a80f-4903-bc2d-71f542590450",
    "metadata": {},
    "outputs": [],
    "source": [
     "%%capture\n",
     "\n",
-    "!pip install --upgrade -r requirements.txt --quiet"
+    "!pip install torchaudio transformers langchain-aws"
    ]
   },
   {
@@ -155,7 +155,7 @@
    "metadata": {},
    "source": [
     "<div class=\"alert alert-block alert-info\">\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'><b>Note: </b><i>The above statements will install the required libraries to run this demo. Be sure to restart the kernel after executing the above lines to bring the installed libraries into memory. The simplest way to restart the Kernel is by typing zero zero: <b> 0 0</b></i></p>\n",
+    "    <p style = 'font-size:16px;font-family:Arial'><b>Note: </b><i>The above statements will install the required libraries to run this demo. Be sure to restart the kernel after executing the above lines to bring the installed libraries into memory. The simplest way to restart the Kernel is by typing zero zero: <b> 0 0</b></i></p>\n",
     "    </div>"
    ]
   },
@@ -164,13 +164,13 @@
    "id": "61067c88-2e9a-4c92-985b-34dc4ab74a13",
    "metadata": {},
    "source": [
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
+    "<hr style='height:1px;border:none;'>\n",
     "\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>1.2 Import the required libraries</b></p>\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>1.2 Import the required libraries</b></p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Here, we import the required libraries, set environment variables and environment paths (if required).</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>Here, we import the required libraries, set environment variables and environment paths (if required).</p>\n",
     "\n",
-    "<ul style=\"font-size: 16px; font-family: Arial; color: #00233C; list-style-type: disc; padding-left: 20px;\">\n",
+    "<ul style=\"font-size: 16px; font-family: Arial; list-style-type: disc; padding-left: 20px;\">\n",
     "    <li>\n",
     "        <b>teradataml</b>: Enables enables us to establish a connection to our database using the <code>create_context()</code> function and allows us to create virtual DataFrames, which serve as references to database objects, allowing exploration of object storage data and enabling operations directly on Vantage without transferring entire datasets to the client, except when needed. For this demo, we will be exploring a dataset in S3 via a foreign table on Vantage.\n",
     "    </li>\n",
@@ -220,21 +220,26 @@
     "    db_drop_table,\n",
     "    db_drop_view,\n",
     "    VectorDistance,\n",
+    "    configure,\n",
+    "    ONNXEmbeddings\n",
     ")\n",
     "\n",
     "# helper functions\n",
     "from utils.sql_helper_func import *\n",
     "from utils.transcripts_helper_func import *\n",
     "\n",
-    "# LLM\n",
-    "from langchain.text_splitter import RecursiveCharacterTextSplitter\n",
+    "# LLM - Updated for langchain 1.0.5\n",
+    "from langchain_text_splitters  import RecursiveCharacterTextSplitter\n",
     "from langchain_community.document_loaders import PyMuPDFLoader\n",
-    "from langchain_aws import ChatBedrockConverse\n",
-    "from langchain.chains import LLMChain\n",
     "from langchain_community.utilities import SQLDatabase\n",
     "from langchain_community.agent_toolkits import create_sql_agent\n",
     "from langchain_core.prompts import PromptTemplate, ChatPromptTemplate\n",
-    "from langchain_community.agent_toolkits.sql.toolkit import SQLDatabaseToolkit\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "import os\n",
+    "import bs4\n",
+    "from langchain.chat_models import init_chat_model\n",
+    "from langchain.tools import tool\n",
+    "from langchain.agents import create_agent\n",
     "\n",
     "# Suppress warnings\n",
     "warnings.filterwarnings(\"ignore\")\n",
@@ -244,78 +249,104 @@
   },
   {
    "cell_type": "markdown",
-   "id": "9e11368d-2efb-4906-8e28-d50f0bca6429",
+   "id": "30714e67-7ba2-449f-8970-3aac125e48d8",
    "metadata": {},
    "source": [
-    "<div class=\"alert alert-block alert-info\">\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'><i>The code above will download the necessary models to generate the embeddings required to run this demo. The initial download may take approximately 50-60 seconds if you are running this demo for the first time in this environment. However, subsequent runs will be much faster since the models will already be available locally.</i></p>\n",
-    "</div>"
+    "<hr style='height:1px;border:none;'>\n",
+    "\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>1.3 Load Audio model and test</b></p>\n",
+    "\n",
+    "<p style = 'font-size:16px;font-family:Arial'>Let's load the <b>Small Language Model (SLM)</b> from <code>huggingface</code> and verify it output.</p>\n"
    ]
   },
   {
-   "cell_type": "markdown",
-   "id": "a59718f8-7af4-4d1a-abc7-a860eb7cbae3",
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "c9d264ff-8919-4799-a2d7-1e06ed527674",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install datasets torchcodec"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "a56af632-837c-4d94-a950-ca93106d4fcd",
    "metadata": {},
+   "outputs": [],
    "source": [
-    "<hr style='height:2px;border:none;background-color:#00233C;'>\n",
-    "<b style = 'font-size:20px;font-family:Arial;color:#00233c'>2. Connect to Vantage</b>"
+    "import os\n",
+    "os.environ[\"HF_HUB_ENABLE_HF_TRANSFER\"] = \"0\"\n",
+    "os.environ[\"HF_HUB_DISABLE_PROGRESS_BARS\"] = \"1\"\n",
+    "\n",
+    "from transformers import WhisperProcessor, WhisperForConditionalGeneration\n",
+    "from datasets import load_dataset\n",
+    "\n",
+    "# load model and processor\n",
+    "processor = WhisperProcessor.from_pretrained(\"openai/whisper-small\")\n",
+    "model = WhisperForConditionalGeneration.from_pretrained(\"openai/whisper-small\")\n",
+    "model.config.forced_decoder_ids = None\n",
+    "\n",
+    "# load dummy dataset and read audio files\n",
+    "ds = load_dataset(\"hf-internal-testing/librispeech_asr_dummy\", \"clean\", split=\"validation\")\n",
+    "sample = ds[2][\"audio\"]\n",
+    "input_features = processor(\n",
+    "    sample[\"array\"],\n",
+    "    sampling_rate=sample[\"sampling_rate\"],\n",
+    "    return_tensors=\"pt\"\n",
+    ").input_features\n",
+    "\n",
+    "# generate token ids\n",
+    "predicted_ids = model.generate(input_features)\n",
+    "\n",
+    "# decode token ids to text\n",
+    "transcription = processor.batch_decode(predicted_ids, skip_special_tokens=True)\n",
+    "print(\"--\"*25)\n",
+    "print(\"transcription: \\n\",transcription[0])\n",
+    "print(\"--\"*25)"
    ]
   },
   {
    "cell_type": "markdown",
-   "id": "83770df6-b923-4cc4-a839-de55c62b32ae",
+   "id": "9e11368d-2efb-4906-8e28-d50f0bca6429",
    "metadata": {},
    "source": [
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>2.1 Connect to Vantage</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We will be prompted to provide the password. We will enter the password, press the Enter key, and then use the down arrow to go to the next cell.</p>"
+    "<div class=\"alert alert-block alert-info\">\n",
+    "    <p style = 'font-size:16px;font-family:Arial'><i>The code above will download the necessary models to generate the embeddings required to run this demo. The initial download may take approximately 50-60 seconds if you are running this demo for the first time in this environment. However, subsequent runs will be much faster since the models will already be available locally.</i></p>\n",
+    "</div>"
    ]
   },
   {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "164cfc91-93ed-45b9-98ba-73b91a50c28b",
+   "cell_type": "markdown",
+   "id": "a59718f8-7af4-4d1a-abc7-a860eb7cbae3",
    "metadata": {},
-   "outputs": [],
    "source": [
-    "%run -i ../startup.ipynb\n",
-    "eng = create_context(host = 'host.docker.internal', username='demo_user', password = password)\n",
-    "print(eng)\n",
-    "execute_sql('''SET query_band='DEMO=PP_Teradata_Enterprise_VectorStore_VectorizingPDFs_GenAI_Python.ipynb;' UPDATE FOR SESSION;''')"
+    "<hr style='height:2px;border:none;'>\n",
+    "<b style = 'font-size:20px;font-family:Arial'>2. Connect to Vantage</b>"
    ]
   },
   {
    "cell_type": "markdown",
-   "id": "d4fdd83d-8dbb-4301-b967-eafe2658bf5c",
+   "id": "83770df6-b923-4cc4-a839-de55c62b32ae",
    "metadata": {},
    "source": [
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
-    "\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233C'><b>2.2 Confirmation for functions</b>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Before starting let us confirm that the required functions are installed.</p>\n",
-    " "
+    "<hr style='height:1px;border:none;'>\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>2.1 Connect to Vantage</b></p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>We will be prompted to provide the password. We will enter the password, press the Enter key, and then use the down arrow to go to the next cell.</p>"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "6f08f5be-0504-4c2c-9c09-44edd6dfc2e0",
+   "id": "164cfc91-93ed-45b9-98ba-73b91a50c28b",
    "metadata": {},
    "outputs": [],
    "source": [
-    "df_check = DataFrame.from_query(\n",
-    "    \"\"\"select count(*) as cnt from dbc.tablesV where databasename = 'ivsm';\"\"\"\n",
-    ")\n",
-    "\n",
-    "if df_check.get_values()[0][0] >= 10:\n",
-    "    print(\"Functions are installed, please continue.\")\n",
-    "else:\n",
-    "    print(\n",
-    "        \"Functions are not installed, please go to Initialization notebook before proceeding further\"\n",
-    "    )\n",
-    "    display(\n",
-    "        Markdown(\"[Initialization Notebook](./Initialization_and_Model_Load.ipynb)\")\n",
-    "    )"
+    "%run -i ../startup.ipynb\n",
+    "eng = create_context(host = 'host.docker.internal', username='demo_user', password = password)\n",
+    "print(eng)\n",
+    "execute_sql('''SET query_band='DEMO=PP_Teradata_Enterprise_VectorStore_VectorizingPDFs_GenAI_Python.ipynb;' UPDATE FOR SESSION;''')"
    ]
   },
   {
@@ -323,10 +354,10 @@
    "id": "490f3fe2-ed63-4838-bb19-4c0d0157453d",
    "metadata": {},
    "source": [
-    "<hr style='height:2px;border:none;background-color:#00233C;'>\n",
-    "<b style = 'font-size:20px;font-family:Arial;color:#00233c'>3. Data Exploration Getting Data for This Demo</b>\n",
+    "<hr style='height:2px;border:none;'>\n",
+    "<b style = 'font-size:20px;font-family:Arial'>3. Data Exploration Getting Data for This Demo</b>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The Chat with documentation demo aims to demonstrate how users can interact with documents such as insurance policy wordings, invoices, and other similar documents through a conversational interface. Additionally in this demo, we have added Audio and text files as well to extract transcripts from audio and make it conversational.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>The Chat with documentation demo aims to demonstrate how users can interact with documents such as insurance policy wordings, invoices, and other similar documents through a conversational interface. Additionally in this demo, we have added Audio and text files as well to extract transcripts from audio and make it conversational.</p>"
    ]
   },
   {
@@ -344,7 +375,7 @@
    "id": "a762f2e5-fd9a-4306-8dd9-f635996a7ddb",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Optional step – We should execute the below step only if we want to see the status of databases/tables created and space used.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>Optional step – We should execute the below step only if we want to see the status of databases/tables created and space used.</p>"
    ]
   },
   {
@@ -362,7 +393,7 @@
    "id": "ea20a791-cc79-45b6-bce7-56cb26c37ac7",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We have a Customer 360 details table containing all the customers' personal and banking-related information. We will use this table to ask questions in natural language and retrieve answers from the Vantage Database.<p/>"
+    "<p style = 'font-size:16px;font-family:Arial'>We have a Customer 360 details table containing all the customers' personal and banking-related information. We will use this table to ask questions in natural language and retrieve answers from the Vantage Database.<p/>"
    ]
   },
   {
@@ -381,11 +412,11 @@
    "id": "5f1626b5-1693-4683-a275-5aa80c862f8d",
    "metadata": {},
    "source": [
-    "<hr style='height:2px;border:none;background-color:#00233C;'>\n",
+    "<hr style='height:2px;border:none;'>\n",
     "<a id='section4'></a>\n",
-    "<b style = 'font-size:20px;font-family:Arial;color:#00233c'>4. Read source data. </b>\n",
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>4.1 Run the data loader </b></p>"
+    "<b style = 'font-size:20px;font-family:Arial'>4. Read source data. </b>\n",
+    "<hr style='height:1px;border:none;'>\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>4.1 Run the data loader </b></p>"
    ]
   },
   {
@@ -393,13 +424,13 @@
    "id": "01cfd968-aaac-45cd-8f3c-35fac1218981",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The Traveller Easy Single Trip - International insurance policy is a comprehensive travel insurance plan that provides cover for a wide range of risks, including medical expenses, trip cancellation, loss of luggage, and personal accident. The policy is designed to be affordable and flexible, and it can be purchased online or over the phone.<p/>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The Traveller Easy Single Trip - International insurance policy is a comprehensive travel insurance plan that provides cover for a wide range of risks, including medical expenses, trip cancellation, loss of luggage, and personal accident. The policy is designed to be affordable and flexible, and it can be purchased online or over the phone.<p/>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The source data from <a href=\"https://axa-com-my.cdn.axa-contento-118412.eu/axa-com-my/3d2f84a5-42b9-459b-911a-710546df0633_Policy+wording+-+SmartTraveller+Easy+Single+Trip+-+International+%280820%29.pdf\">AXA</a> is loaded in Teradata Vantage as Vector Database.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The source data from <a href=\"https://axa-com-my.cdn.axa-contento-118412.eu/axa-com-my/3d2f84a5-42b9-459b-911a-710546df0633_Policy+wording+-+SmartTraveller+Easy+Single+Trip+-+International+%280820%29.pdf\">AXA</a> is loaded in Teradata Vantage as Vector Database.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Now, let's use <code>PyMuPDFLoader</code> library to read the pdf document and split it into pages.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>Now, let's use <code>PyMuPDFLoader</code> library to read the pdf document and split it into pages.</p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>For Audio files <code>openai/whisper-small</code> open source audio model we have used to extract the transcripts and split it into pages.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>For Audio files <code>openai/whisper-small</code> open source audio model we have used to extract the transcripts and split it into pages.</p>"
    ]
   },
   {
@@ -508,154 +539,166 @@
    "id": "0d0aa71d-457d-448a-ad11-da18ed77dd94",
    "metadata": {},
    "source": [
-    "<p style='font-size:16px;font-family:Arial;color:#00233C'>In the above cell, we will read all the pages of the PDF file and split them into pages. To process further, we will save documents to Vantage.</p>"
+    "<p style='font-size:16px;font-family:Arial'>In the above cell, we will read all the pages of the PDF file and split them into pages. To process further, we will save documents to Vantage.</p>"
    ]
   },
   {
    "cell_type": "markdown",
-   "id": "40a95d65-8866-4423-9a52-2dec04c63ab4",
+   "id": "b2cac2b5-9e86-427a-9579-cbccebb1c2bd",
    "metadata": {},
    "source": [
-    "<hr style=\"height:2px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:20px;font-family:Arial;color:#00233C'><b>5. Generate embeddings from the chunks.</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We will create prompts for different questions that can be answered from the document. Below are some sample questions that can be asked.</p>"
+    "<hr style=\"height:2px;border:none;\">\n",
+    "<p style = 'font-size:20px;font-family:Arial;'><b>5. Load HuggingFace Model</b>\n",
+    "<p style = 'font-size:16px;font-family:Arial;'>To generate embeddings, we need an ONNX model capable of transforming text into vector representations. We use a pretrained model from [Teradata's Hugging Face repository](https://huggingface.co/Teradata/gte-base-en-v1.5), such as gte-base-en-v1.5. The model and its tokenizer are downloaded and stored in Vantage tables as BLOBs using the save_byom function.</p>"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "39ca8f97-c763-4969-980c-cdd1ff0b0a83",
+   "id": "b1460110-5323-406b-b468-749449243fb9",
    "metadata": {},
    "outputs": [],
    "source": [
-    "def tokenizer_encode(input_data_table, cols_to_preserve):\n",
-    "    view_name = \"v_tokenizer_encode\"\n",
-    "    qry = f\"\"\"\n",
-    "    REPLACE VIEW {view_name} AS (\n",
-    "      SELECT \n",
-    "        {', '.join(cols_to_preserve)},\n",
-    "        IDS AS input_ids, \n",
-    "        attention_mask \n",
-    "      FROM \n",
-    "        ivsm.tokenizer_encode(\n",
-    "          ON (select * from {input_data_table}) ON (\n",
-    "            SELECT \n",
-    "              model AS tokenizer \n",
-    "            FROM \n",
-    "              embeddings_tokenizers \n",
-    "            WHERE \n",
-    "              model_id = 'bge-small-en-v1.5'\n",
-    "          ) DIMENSION\n",
-    "          USING \n",
-    "              ColumnsToPreserve({str(cols_to_preserve).strip('[]')})\n",
-    "              OutputFields('IDS', 'ATTENTION_MASK')\n",
-    "              MaxLength(1024)\n",
-    "              PadToMaxLength('True')\n",
-    "              TokenDataType('INT64')\n",
-    "        ) a\n",
-    "    );\"\"\"\n",
+    "import os\n",
+    "from huggingface_hub import hf_hub_download\n",
     "\n",
-    "    try:\n",
-    "        execute_sql(qry)\n",
-    "        print(f\"View: {view_name} Created\")\n",
-    "        return view_name\n",
-    "    except Exception as error:\n",
-    "        print(error)"
+    "# set env\n",
+    "os.environ[\"HF_HUB_DISABLE_PROGRESS_BARS\"] = \"1\"\n",
+    "os.environ[\"HF_HUB_DISABLE_SYMLINKS_WARNING\"] = \"1\"\n",
+    "os.environ[\"HF_HUB_ENABLE_HF_TRANSFER\"] = \"0\"\n",
+    "\n",
+    "model_name = \"bge-base-en-v1.5\"\n",
+    "number_dimensions_output = 768\n",
+    "model_file_name = \"model.onnx\"\n",
+    "\n",
+    "hf_hub_download(repo_id=f\"Teradata/{model_name}\", filename=f\"onnx/{model_file_name}\", local_dir=\"./\")\n",
+    "hf_hub_download(repo_id=f\"Teradata/{model_name}\", filename=\"tokenizer.json\", local_dir=\"./\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "823b1e2a-1c34-4e45-bd19-6e7d4057c6ab",
+   "metadata": {},
+   "source": [
+    "<hr style=\"height:1px;border:none\">\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>5.1 Save the Model</b></p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>In above steps, we have checked that the model is working fine in ONNX format. Now we will save the model file.</p>"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "97934bc5-cf41-4ce7-be7f-2e06dc5ceb7c",
+   "id": "cde3d2ff-3497-4a2c-80f6-35ba3d9a6f70",
    "metadata": {},
    "outputs": [],
    "source": [
-    "def ivsm_score(tokenizer_encode_view_name, cols_to_preserve):\n",
-    "    view_name = \"v_ivsm_score\"\n",
-    "    qry = f\"\"\"\n",
-    "    REPLACE VIEW {view_name} AS (\n",
-    "      SELECT \n",
-    "        * \n",
-    "      FROM \n",
-    "        ivsm.IVSM_score(\n",
-    "          ON {tokenizer_encode_view_name} ON (\n",
-    "            SELECT \n",
-    "              * \n",
-    "            FROM \n",
-    "              embeddings_models \n",
-    "            WHERE \n",
-    "              model_id = 'bge-small-en-v1.5'\n",
-    "          ) dimension\n",
-    "          USING \n",
-    "              ColumnsToPreserve({str(cols_to_preserve).strip('[]')})\n",
-    "              ModelType('ONNX')\n",
-    "              BinaryInputFields('input_ids', 'attention_mask')\n",
-    "              BinaryOutputFields('sentence_embedding')\n",
-    "              Caching('inquery')\n",
-    "        ) a\n",
-    "    );\n",
-    "    \"\"\"\n",
-    "\n",
-    "    try:\n",
-    "        execute_sql(qry)\n",
-    "        print(f\"View: {view_name} Created\")\n",
-    "        return view_name\n",
-    "    except Exception as error:\n",
-    "        print(error)"
+    "try:\n",
+    "    db_drop_table(\"embeddings_models\")\n",
+    "except Exception as e:\n",
+    "    pass\n",
+    "try:\n",
+    "    db_drop_table(\"embeddings_tokenizers\")\n",
+    "except:\n",
+    "    pass"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "2b957145-7f58-44f9-99f6-bb8f0b2be371",
+   "id": "53af356f-3e78-4af5-9b59-c5e66833e81d",
    "metadata": {},
    "outputs": [],
    "source": [
-    "def vector_to_columns(ivsm_score_view_name, cols_to_preserve, target_table_name):\n",
-    "    qry = f\"\"\"\n",
-    "    CREATE TABLE {target_table_name} AS (\n",
-    "      SELECT \n",
-    "        *\n",
-    "      FROM \n",
-    "        ivsm.vector_to_columns(\n",
-    "          ON {ivsm_score_view_name} \n",
-    "          USING \n",
-    "          ColumnsToPreserve({str(cols_to_preserve).strip('[]')})\n",
-    "          VectorDataType('FLOAT32')\n",
-    "          VectorLength(384)\n",
-    "          OutputColumnPrefix('emb_')\n",
-    "          InputColumnName('sentence_embedding')\n",
-    "        ) a\n",
-    "    ) WITH DATA;\"\"\"\n",
-    "\n",
-    "    try:\n",
-    "        start = time.time()\n",
-    "        print(\"*\" * 70)\n",
-    "        print(\"Generating embeddings and saving to the database, please wait...\")\n",
-    "        print(\"*\" * 70)\n",
-    "        display(loading_spinner)\n",
-    "        execute_sql(qry)\n",
-    "        end = time.time()\n",
-    "\n",
-    "        print(\"*\" * 50)\n",
-    "        print(\"We have successfully read the row data and generated the embeddings.\")\n",
-    "        print(f\"Time taken for generating the embeddings: {end - start} seconds\")\n",
-    "        print(\"*\"*50,'\\n',\"*\"*50, sep=\"\")\n",
-    "        print(\"Embeddings generated and saved successfully to table!\")\n",
-    "        print(\"*\" * 50)\n",
+    "# Step 2: Load Models into Vantage\n",
+    "# a) Embedding model\n",
+    "save_byom(model_id = model_name, # must be unique in the models table\n",
+    "               model_file = f\"onnx/{model_file_name}\",\n",
+    "               table_name = 'embeddings_models' )\n",
+    "# b) Tokenizer\n",
+    "save_byom(model_id = model_name, # must be unique in the models table\n",
+    "              model_file = 'tokenizer.json',\n",
+    "              table_name = 'embeddings_tokenizers') "
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "b03f5eb7-e703-473b-bc39-58c1b355f8ca",
+   "metadata": {},
+   "source": [
+    "<p style = 'font-size:16px;font-family:Arial;'>Recheck the installed model and tokenizer"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "19dde44f-1aca-4f1e-be90-4c7a8703b6ff",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "df_model = DataFrame('embeddings_models')\n",
+    "df_model"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "f827989c-736f-462a-864d-763679415240",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "df_token = DataFrame('embeddings_tokenizers')\n",
+    "df_token"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "eb2501ce-3fe6-40d5-9f86-f696d51d033c",
+   "metadata": {},
+   "source": [
+    "<p style = 'font-size:16px;font-family:Arial'>Load the mode that we have save to DB in previous notebook by passing Model ID.</p>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "bc277932-60cf-4f14-92db-546f33c2dafc",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "my_model = DataFrame.from_query(f\"select * from embeddings_models where model_id = '{model_name}'\")\n",
+    "my_tokenizer = DataFrame.from_query(f\"select model as tokenizer from embeddings_tokenizers where model_id = '{model_name}'\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "40a95d65-8866-4423-9a52-2dec04c63ab4",
+   "metadata": {},
+   "source": [
+    "<hr style=\"height:2px;border:none;\">\n",
+    "<p style = 'font-size:20px;font-family:Arial'><b>5. Generate embeddings from the chunks.</b></p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>We will create prompts for different questions that can be answered from the document. Below are some sample questions that can be asked.</p>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "46431bb8-5524-42cf-bc36-62863277b5b5",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "configure.val_install_location = \"val\"\n",
+    "configure.byom_install_location = \"mldb\"\n",
     "\n",
-    "    except:\n",
-    "        db_drop_table(target_table_name)\n",
-    "        execute_sql(qry)\n",
-    "        end = time.time()\n",
-    "        print(f\"Time taken for generating the embeddings: {end - start} seconds\")\n",
-    "    finally:\n",
-    "        loading_spinner.value = \"\"\n",
-    "        print(\"*\" * 80)\n",
-    "        print(\n",
-    "            \"Embeddings generated and saved successfully to table! Thank you for your patience!\"\n",
-    "        )\n",
-    "        print(\"*\" * 80)"
+    "def generate_embeddings_data(input_tdf, cols_to_preserve):\n",
+    "    return ONNXEmbeddings(\n",
+    "    newdata = input_tdf,\n",
+    "    modeldata = my_model, \n",
+    "    tokenizerdata = my_tokenizer, \n",
+    "    accumulate = cols_to_preserve,\n",
+    "    model_output_tensor = \"sentence_embedding\",\n",
+    "    output_format = f'FLOAT32({number_dimensions_output})',\n",
+    "    enable_memory_check = False\n",
+    ").result"
    ]
   },
   {
@@ -663,25 +706,25 @@
    "id": "387d565b-5898-4859-b28e-0a2779c2cee9",
    "metadata": {},
    "source": [
-    "<hr style=\"height:1px;border:none;background-color:#00233C;\">\n",
+    "<hr style=\"height:1px;border:none;\">\n",
     "\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>5.1 Do you want to generate the embeddings?</b></p>    \n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Generating embeddings will take around <b>35-40 minutes.</b></p>\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>5.1 Do you want to generate the embeddings?</b></p>    \n",
+    "<p style = 'font-size:16px;font-family:Arial'>Generating embeddings will take around <b>35-40 minutes.</b></p>\n",
     "\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We have already generated embeddings for the pdf and stored them in <b>Vantage</b> table.</p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>We have already generated embeddings for the pdf and stored them in <b>Vantage</b> table.</p>\n",
     " \n",
     "<center><img src=\"images/decision_emb_gen_2.svg\" alt=\"embeddings_decision\"  width=300 height=400/></center>\n",
     " \n",
     "<div class=\"alert alert-block alert-info\">\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'><i><b>Note: If you would like to skip the embedding generation step to save the time and move quickly to next step, please enter \"No\" in the next prompt.</b></i></p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'><i><b>Note: If you would like to skip the embedding generation step to save the time and move quickly to next step, please enter \"No\" in the next prompt.</b></i></p>\n",
     "</div>\n",
     "\n",
     "<div class=\"alert alert-block alert-warning\">\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'><i><b>Note:</b>If you choose <b>\"yes\"</b> to run the embeddings generation step, you must first execute the <a href=\"./Initialization_and_Model_Load.ipynb\">Initialization_and_Model_Load.ipynb</a> file to install the ONNX model on the ClearScape machine.</i></p>\n",
+    "    <p style = 'font-size:16px;font-family:Arial'><i><b>Note:</b>If you choose <b>\"yes\"</b> to run the embeddings generation step, you must first execute the <a href=\"./Initialization_and_Model_Load.ipynb\">Initialization_and_Model_Load.ipynb</a> file to install the ONNX model on the ClearScape machine.</i></p>\n",
     "</div>\n",
     "\n",
     " \n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>To save time, you can move to the already generated embeddings section. However, if you would like to see how we generate the embeddings, or if you need to generate the embeddings for a different dataset, then continue to the following section.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>To save time, you can move to the already generated embeddings section. However, if you would like to see how we generate the embeddings, or if you need to generate the embeddings for a different dataset, then continue to the following section.</p>"
    ]
   },
   {
@@ -724,9 +767,9 @@
     "\n",
     "    # create views\n",
     "    cols_to_preserve = [\"id\", \"txt\", \"file_name\"]\n",
-    "    tokenizer_encode_view_name = tokenizer_encode(\"docs_data\", cols_to_preserve)\n",
-    "    ivsm_score_view_name = ivsm_score(tokenizer_encode_view_name, cols_to_preserve)\n",
-    "    vector_to_columns(ivsm_score_view_name, cols_to_preserve, \"pdf_embeddings_store\")"
+    "    docs_data = DataFrame(\"docs_data\")\n",
+    "    df_embeddings = generate_embeddings_data(docs_data, cols_to_preserve)\n",
+    "    copy_to_sql(df_embeddings,table_name='pdf_embeddings_store', if_exists='replace', index=False)"
    ]
   },
   {
@@ -755,7 +798,7 @@
     "    print(\"*\" * 60)\n",
     "\n",
     "    # load embeddings to sql\n",
-    "    embeddings_prq = pd.read_parquet(\"./embeddings/all_embeddings_v2.parquet.gzip\")\n",
+    "    embeddings_prq = pd.read_parquet(\"./embeddings/all_embeddings_v3.parquet.gzip\")\n",
     "\n",
     "    # save to DB\n",
     "    delete_and_copy_embeddings(\n",
@@ -785,20 +828,25 @@
     "\n",
     "def get_section5_desc_start(tdf):\n",
     "    return f\"\"\"<div class=\"alert alert-info\">\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'><i><b>Please be patient:</b> Generating embeddings for {tdf.shape[0]} document contents may take up to 35 to 40 minutes. It is depends on number of APMS in the database. Since the volume of data is large and the machine is small, going through the below code could take up to 40 minutes. </i></p>\n",
+    "    <p style = 'font-size:16px;font-family:Arial'><i><b>Please be patient:</b> Generating embeddings for {tdf.shape[0]} document contents may take up to 35 to 40 minutes. It is depends on number of APMS in the database. Since the volume of data is large and the machine is small, going through the below code could take up to 40 minutes. </i></p>\n",
     "</div>\"\"\"\n",
     "\n",
     "\n",
     "# Request user's input\n",
     "generate = input(\"Do you want to generate embeddings? ('yes'/'no'): \")\n",
     "\n",
-    "# Check the user's input\n",
-    "if generate.lower() == \"yes\":\n",
-    "    generate_emb()\n",
-    "elif generate.lower() == \"no\":\n",
-    "    load_data_emb()\n",
-    "else:\n",
-    "    print(\"\\nInvalid input. Please enter 'yes' or 'no' to proceed.\")"
+    "try:\n",
+    "    # Check the user's input\n",
+    "    if generate.lower() == \"yes\":\n",
+    "        generate_emb()\n",
+    "    elif generate.lower() == \"no\":\n",
+    "        load_data_emb()\n",
+    "    else:\n",
+    "        print(\"\\nInvalid input. Please enter 'yes' or 'no' to proceed.\")\n",
+    "except Exception as e:\n",
+    "    print(\"ERROR: \", e)\n",
+    "finally:\n",
+    "    loading_spinner.value = \"\""
    ]
   },
   {
@@ -817,7 +865,7 @@
    "id": "508be45c-ebdf-4292-badf-72507128c3d1",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Let's view the shape of embeddings table.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>Let's view the shape of embeddings table.</p>"
    ]
   },
   {
@@ -835,8 +883,8 @@
    "id": "1bb61754-527e-467b-86f1-4ec6f98e0d45",
    "metadata": {},
    "source": [
-    "<hr style=\"height:2px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:20px;font-family:Arial;color:#00233C'><b>6. Insert Prompts into a Table</b></p>\n"
+    "<hr style=\"height:2px;border:none;\">\n",
+    "<p style = 'font-size:20px;font-family:Arial'><b>6. Insert Prompts into a Table</b></p>\n"
    ]
   },
   {
@@ -844,7 +892,7 @@
    "id": "f0664918-7abc-4c30-8fc3-7af415475475",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We will create the required table and than we will insert different values for the prompts.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>We will create the required table and than we will insert different values for the prompts.</p>"
    ]
   },
   {
@@ -870,7 +918,7 @@
    "id": "b59b19d1-80cc-4397-aee5-d43fb1fa5656",
    "metadata": {},
    "source": [
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We will create prompts for different questions that can be answered from the document. Below are some sample questions that can be asked.</p>"
+    "<p style = 'font-size:16px;font-family:Arial'>We will create prompts for different questions that can be answered from the document. Below are some sample questions that can be asked.</p>"
    ]
   },
   {
@@ -925,12 +973,12 @@
    "id": "14bd2227-0019-4ff4-b9f0-d62e7e1e1086",
    "metadata": {},
    "source": [
-    "<hr style=\"height:2px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:20px;font-family:Arial;color:#00233C'><b>7. Generate Embeddings from the Prompts</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We will create embeddings for the prompts which we have inserted into the table above.</p>\n",
+    "<hr style=\"height:2px;border:none;\">\n",
+    "<p style = 'font-size:20px;font-family:Arial'><b>7. Generate Embeddings from the Prompts</b></p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>We will create embeddings for the prompts which we have inserted into the table above.</p>\n",
     "\n",
     "<div class=\"alert alert-block alert-warning\">\n",
-    "    <p style = 'font-size:16px;font-family:Arial;color:#00233C'><i><b>Note:</b>If you choose <b>\"yes\"</b> to run the embeddings generation step, you must first execute the <a href=\"./Initialization_and_Model_Load.ipynb\">Initialization_and_Model_Load.ipynb</a> file to install the ONNX model on the ClearScape machine.</i></p>\n",
+    "    <p style = 'font-size:16px;font-family:Arial'><i><b>Note:</b>If you choose <b>\"yes\"</b> to run the embeddings generation step, you must first execute the <a href=\"./Initialization_and_Model_Load.ipynb\">Initialization_and_Model_Load.ipynb</a> file to install the ONNX model on the ClearScape machine.</i></p>\n",
     "</div>"
    ]
   },
@@ -961,9 +1009,10 @@
     "\n",
     "    display(loading_spinner)\n",
     "    cols_to_preserv = [\"query_id\", \"txt\"]\n",
-    "    tokenizer_encode_view_name = tokenizer_encode(\"question_to_ask\", cols_to_preserv)\n",
-    "    ivsm_score_view_name = ivsm_score(tokenizer_encode_view_name, cols_to_preserv)\n",
-    "    vector_to_columns(ivsm_score_view_name, cols_to_preserv, \"question_to_ask_embeddings\")\n",
+    "    question_to_ask = DataFrame(\"question_to_ask\")\n",
+    "    df_embeddings_que = generate_embeddings_data(question_to_ask, cols_to_preserv)\n",
+    "    copy_to_sql(df_embeddings_que,table_name='question_to_ask_embeddings', if_exists='replace', index=False)\n",
+    "    loading_spinner.value = \"\"\n",
     "    \n",
     "elif generate.lower() == \"no\":\n",
     "    load_que_emb(table_name = \"question_to_ask_embeddings\")\n",
@@ -997,9 +1046,9 @@
    "id": "2098357a-9a91-4df6-adef-b8fc79003a64",
    "metadata": {},
    "source": [
-    "<hr style=\"height:2px;border:none;background-color:#00233C;\">\n",
-    "<p style = 'font-size:20px;font-family:Arial;color:#00233C'><b>8. Find top 10 matching chunks</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We will find the top 10 chunks that match the queries using the <b>TD_VectorDistance</b>. The TD_VectorDistance function accepts a table of target vectors and a table of reference vectors and returns a table that contains the distance between target-reference pairs. The function computes the distance between the target pair and the reference pair from the same table. We must have the same column order in the TargetFeatureColumns argument and the RefFeatureColumns argument. The function ignores the feature values during distance computation if the value is either NULL, NAN, or INF.</p>"
+    "<hr style=\"height:2px;border:none;\">\n",
+    "<p style = 'font-size:20px;font-family:Arial'><b>8. Find top 10 matching chunks</b></p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>We will find the top 10 chunks that match the queries using the <b>TD_VectorDistance</b>. The TD_VectorDistance function accepts a table of target vectors and a table of reference vectors and returns a table that contains the distance between target-reference pairs. The function computes the distance between the target pair and the reference pair from the same table. We must have the same column order in the TargetFeatureColumns argument and the RefFeatureColumns argument. The function ignores the feature values during distance computation if the value is either NULL, NAN, or INF.</p>"
    ]
   },
   {
@@ -1158,10 +1207,10 @@
    "id": "853abd90-5f61-4f02-be41-bf1fef65b325",
    "metadata": {},
    "source": [
-    "<hr style=\"height:2px;border:none;background-color:#00233C;\">\n",
-    "<b style = 'font-size:20px;font-family:Arial;color:#00233C'>9. Configuring AWS CLI and Initialize Bedrock Model</b>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The following cell will prompt us for the following information:</p>\n",
-    "<ol style = 'font-size:16px;font-family:Arial;color:#00233C'>\n",
+    "<hr style=\"height:2px;border:none;\">\n",
+    "<b style = 'font-size:20px;font-family:Arial'>9. Configuring AWS CLI and Initialize Bedrock Model</b>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The following cell will prompt us for the following information:</p>\n",
+    "<ol style = 'font-size:16px;font-family:Arial'>\n",
     "<li><b>aws_access_key_id</b>: Enter your AWS access key ID</li>\n",
     "<li><b>aws_secret_access_key</b>: Enter your AWS secret access key</li>\n",
     "<li><b>region name</b>: Enter the AWS region you want to configure (e.g., us-east-1)</li>\n",
@@ -1171,46 +1220,14 @@
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "4a51778f-ad16-4939-b1ea-8db5a92e3102",
-   "metadata": {
-    "tags": []
-   },
-   "outputs": [],
-   "source": [
-    "def configure_aws():\n",
-    "    print(\"configure the AWS CLI\")\n",
-    "    # enter the access_key/secret_key\n",
-    "    access_key = getpass.getpass(\"aws_access_key_id \")\n",
-    "    secret_key = getpass.getpass(\"aws_secret_access_key \")\n",
-    "    region_name = getpass.getpass(\"region name\")\n",
-    "\n",
-    "    #set to the env\n",
-    "    !aws configure set aws_access_key_id {access_key}\n",
-    "    !aws configure set aws_secret_access_key {secret_key}\n",
-    "    !aws configure set default.region {region_name}"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "fdd8eda1-662c-4f4f-b082-c6fcf6810647",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "does_access_key_exists = !aws configure get aws_access_key_id\n",
-    "\n",
-    "if len(does_access_key_exists) == 0:\n",
-    "    configure_aws()"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "3a8fb8a1-bd99-4336-a12c-9cfb00dbf784",
+   "id": "1bd085be-9012-43e8-9a02-5975d32d5c3a",
    "metadata": {},
    "outputs": [],
    "source": [
-    "!aws configure list"
+    "access_key = getpass.getpass(\"aws_access_key_id \")\n",
+    "secret_key = getpass.getpass(\"aws_secret_access_key \")\n",
+    "session_token = getpass.getpass(\"aws_session_token\")\n",
+    "region_name = getpass.getpass(\"region name\")"
    ]
   },
   {
@@ -1218,14 +1235,14 @@
    "id": "c26f6369-811f-45a3-8bbb-28e853695e7c",
    "metadata": {},
    "source": [
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>9.1 Connect to databases using SQL Alchemy</b></p>    \n",
+    "<hr style='height:1px;border:none;'>\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>9.1 Connect to databases using SQL Alchemy</b></p>    \n",
     "\n",
-    "<p style='font-size:16px;font-family:Arial;color:#00233C'>Under the hood, we use SQLAlchemy to connect to SQL databases. This means that the SQLDatabaseChain can be used with any SQL dialect supported by SQLAlchemy, such as Teradata Vantage, MS SQL, MySQL, MariaDB, PostgreSQL, Oracle SQL, and SQLite. For more information about the requirements for connecting to our database, we recommend referring to the <a href=\"https://docs.sqlalchemy.org/en/20/\">SQLAlchemy documentation</a>.</p>\n",
+    "<p style='font-size:16px;font-family:Arial'>Under the hood, we use SQLAlchemy to connect to SQL databases. This means that the SQLDatabaseChain can be used with any SQL dialect supported by SQLAlchemy, such as Teradata Vantage, MS SQL, MySQL, MariaDB, PostgreSQL, Oracle SQL, and SQLite. For more information about the requirements for connecting to our database, we recommend referring to the <a href=\"https://docs.sqlalchemy.org/en/20/\">SQLAlchemy documentation</a>.</p>\n",
     "\n",
-    "<p style='font-size:16px;font-family:Arial;color:#00233C'>Important: The code below establishes a database connection for our data sources and Large Language Models. Please note that the solution will only work if we define the database connection for our sources in the cell below.</p>\n",
+    "<p style='font-size:16px;font-family:Arial'>Important: The code below establishes a database connection for our data sources and Large Language Models. Please note that the solution will only work if we define the database connection for our sources in the cell below.</p>\n",
     "\n",
-    "<p style='font-size:16px;font-family:Arial;color:#00233C'>We build a consolidated view of the Table Data Catalog by combining metadata stored for the database and table.</p>"
+    "<p style='font-size:16px;font-family:Arial'>We build a consolidated view of the Table Data Catalog by combining metadata stored for the database and table.</p>"
    ]
   },
   {
@@ -1296,26 +1313,23 @@
    "id": "14ae78e9-76ce-42ce-9c89-722a372ef3e0",
    "metadata": {},
    "source": [
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
+    "<hr style='height:1px;border:none;'>\n",
+    "\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>9.2 Define LLM model</b></p>  \n",
     "\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>9.2 Define LLM model</b></p>  \n",
     "\n",
-    "<p style=\"font-size: 16px; font-family: Arial; color: #00233C;\">\n",
-    "     First ensure you have requested and been granted access to the <a href=\"https://docs.aws.amazon.com/bedrock/latest/userguide/model-access.html\">foundation model you want to use</a>. In this example, we are using Anthropic Claude 3 Sonnet. \n",
-    "</p>   \n",
-    "<img src=\"images/bedrock-claude-3.png\" alt=\"Bedrock Claude 3\" style=\"width: 90%; height: auto;\">\n",
-    "<p style=\"font-size: 16px; font-family: Arial; color: #00233C;\">\n",
+    "<p style=\"font-size: 16px; font-family: Arial;\">\n",
     "    Define the LLM using the <code>ChatBedrockConverse</code> interface. When defining <code>ChatBedrockConverse</code>, set the <a href=\"https://docs.aws.amazon.com/bedrock/latest/userguide/model-ids.html#model-ids-arns\">Amazon Bedrock base model ID</a>, the client as <code>boto3_bedrock</code>, and the common inference parameters.\n",
     "</p> \n",
-    "<p style=\"font-size: 16px; font-family: Arial; color: #00233C;\">\n",
+    "<p style=\"font-size: 16px; font-family: Arial;\">\n",
     "    We use the optional parameter <b>temperature</b> to make our Teradata SQL outputs more predictable.\n",
     "</p>\n",
     "\n",
-    "<div style=\"margin-left: 16px; font-size: 16px; font-family: Arial; color: #00233C;\">\n",
+    "<div style=\"margin-left: 16px; font-size: 16px; font-family: Arial;\">\n",
     "    <b>- Temperature:</b> which can range from 0.0 to 2 and controls how creative our results will be, Setting it to 0.1 ensures the model favors higher-probability (more predictable) words, resulting in more consistent and less varied outputs.<br>\n",
     "</div>\n",
     "\n",
-    "<p style=\"font-size: 16px; font-family: Arial; color: #00233C;\">\n",
+    "<p style=\"font-size: 16px; font-family: Arial;\">\n",
     "    For a complete list of optional parameters for base models provided by Amazon Bedrock, visit the <a href=\"https://docs.aws.amazon.com/bedrock/latest/userguide/model-parameters.html\"> AWS docs</a>.\n",
     "</p>"
    ]
@@ -1323,13 +1337,28 @@
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "21787594-d460-4768-8d44-eeb1cda129ce",
+   "id": "3b4129e1-f1dd-4546-a985-108ec7a656fb",
    "metadata": {},
    "outputs": [],
    "source": [
-    "llm = ChatBedrockConverse(\n",
-    "    model=\"anthropic.claude-3-sonnet-20240229-v1:0\", temperature=0.1, max_tokens=1000\n",
-    ")"
+    "# !pip install langchain-aws"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "06c81df1-0946-44ed-a5ba-80c440129ead",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "from langchain.chat_models import init_chat_model\n",
+    "\n",
+    "llm = init_chat_model(model_provider=\"bedrock\", model=\"us.anthropic.claude-3-7-sonnet-20250219-v1:0\",\n",
+    "                        region_name=region_name,\n",
+    "                        aws_access_key_id=access_key,\n",
+    "                        aws_secret_access_key=secret_key,\n",
+    "                        aws_session_token=session_token)"
    ]
   },
   {
@@ -1337,24 +1366,24 @@
    "id": "6dfb82f7-ea5c-48c5-a91d-c4e090c3ce7e",
    "metadata": {},
    "source": [
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>9.3 Define SQL Agent</b></p>  \n",
+    "<hr style='height:1px;border:none;'>\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>9.3 Define SQL Agent</b></p>  \n",
     "\n",
-    "<p style=\"font-size: 16px; font-family: Arial; color: #00233C;\">\n",
+    "<p style=\"font-size: 16px; font-family: Arial;\">\n",
     "    With the connection to Teradata Vantage established and our database (<code>db</code>) and Large Language Model (<code>LLM</code>) defined, we are ready to create and invoke our SQL Agent using the <code>create_sql_agent()</code> function. \n",
     "    </p>\n",
-    "<p style=\"font-size: 16px; font-family: Arial; color: #00233C;\">\n",
+    "<p style=\"font-size: 16px; font-family: Arial;\">\n",
     "    We pass in our <code>llm</code> and <code>db</code> as required parameters and set <code>agent_type</code> to \"zero-shot-react-description\" to instruct the agent to perform a reasoning step before acting.  \n",
     "    </p>\n",
-    "<p style=\"font-size: 16px; font-family: Arial; color: #00233C;\">\n",
+    "<p style=\"font-size: 16px; font-family: Arial;\">\n",
     "    We set <code>verbose</code> to true so that the agent can output detailed information of intermediate steps. Additionally, we set <code>handle_parsing_errors</code> to <code>True</code>, ensuring that errors are sent back to the LLM as observations, for the LLM to attempt handling the errors.\n",
     "    </p>\n",
     "    \n",
-    "<p style=\"font-size: 16px; font-family: Arial; color: #00233C;\">\n",
+    "<p style=\"font-size: 16px; font-family: Arial;\">\n",
     " We can optimize the agents performance with additional prompt engineering. \n",
     "</p>\n",
     "\n",
-    "<p style=\"font-size: 16px; font-family: Arial; color: #00233C;\">\n",
+    "<p style=\"font-size: 16px; font-family: Arial;\">\n",
     "We import a <code>ChatPromptTemplate</code> class to build flexible reusable prompts in our agent. Here we define a prefix, format instructions, and a suffix and join them to create a custom prompt. The prefix has unique rules that apply to Teradata. The format guides it's Question, thought, observation behavior and the suffix cues it to begin. \n",
     "</p>"
    ]
@@ -1369,6 +1398,10 @@
     "prefix = (\n",
     "    \"\"\"You are an helpful and expert TeradataSQL database admin. TeradataSQL shares many similarities to SQL, with a few key differences.\n",
     "Given an input question, first create a syntactically correct TeradataSQL query to run, then look at the results of the query and return the answer.\n",
+    "Given an input question, create a syntactically correct {dialect} query to run,\n",
+    "then look at the results of the query and return the answer. Unless the user\n",
+    "specifies a specific number of examples they wish to obtain, always limit your\n",
+    "query to at most {top_k} results.\n",
     "\n",
     "IMPORTANT: Unless the user specifies an exact number of rows they wish to obtain, you must always limit your query to at most {top_k} results by using \"SELECT TOP {top_k}\".\n",
     "\n",
@@ -1460,19 +1493,23 @@
    "id": "c5546204-b5f3-430b-805d-c58b28962af7",
    "metadata": {},
    "source": [
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>9.4 Setup Hybrid RAG</b></p>  \n",
-    "<p style=\"font-size: 16px; font-family: Arial; color: #00233C;\">We have source data stored in both VectorDB and Vantage Database. Our hybrid RAG system is designed to automatically identify the appropriate source and query it accordingly. In some cases, responses to certain questions may be derived from both sources.</p>"
+    "<hr style='height:1px;border:none;'>\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>9.4 Setup Hybrid RAG</b></p>  \n",
+    "<p style=\"font-size: 16px; font-family: Arial;\">We have source data stored in both VectorDB and Vantage Database. Our hybrid RAG system is designed to automatically identify the appropriate source and query it accordingly. In some cases, responses to certain questions may be derived from both sources.</p>"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "264ad464-7dee-4805-aeda-69ac20874133",
+   "id": "13949f98-c2c8-4a98-acb2-f51e7a8e6e42",
    "metadata": {},
    "outputs": [],
    "source": [
-    "def _get_classifier():\n",
+    "from langchain.tools import tool\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "\n",
+    "def _get_classifier(query):\n",
+    "    \"\"\"Classify the query type\"\"\"\n",
     "    query_classifier_prompt = PromptTemplate(\n",
     "        input_variables=[\"query\"],\n",
     "        template=\"\"\"Classify if the following query requires:\n",
@@ -1486,7 +1523,8 @@
     "            Return only one word: SQL, VECTOR, or BOTH\n",
     "            \"\"\",\n",
     "    )\n",
-    "    return LLMChain(llm=llm, prompt=query_classifier_prompt)\n",
+    "    chain = query_classifier_prompt | llm | StrOutputParser()\n",
+    "    return chain\n",
     "\n",
     "\n",
     "def _query_vector_store(query_id, query):\n",
@@ -1504,8 +1542,8 @@
     "        Answer:\"\"\",\n",
     "    )\n",
     "\n",
-    "    response_chain = LLMChain(llm=llm, prompt=response_prompt)\n",
-    "    response = response_chain.run({\"context\": context, \"query\": query})\n",
+    "    response_chain = response_prompt | llm | StrOutputParser()\n",
+    "    response = response_chain.invoke({\"context\": context, \"query\": query})\n",
     "    return {\"response\": response, \"reference\": file_names, \"ref_ids\": ref_ids}\n",
     "\n",
     "\n",
@@ -1522,8 +1560,8 @@
     "        Combined Answer:\"\"\",\n",
     "    )\n",
     "\n",
-    "    combination_chain = LLMChain(llm=llm, prompt=combination_prompt)\n",
-    "    combined_response = combination_chain.run(\n",
+    "    combination_chain = combination_prompt | llm | StrOutputParser()\n",
+    "    combined_response = combination_chain.invoke(\n",
     "        {\n",
     "            \"sql_response\": sql_response,\n",
     "            \"vector_response\": vector_response,\n",
@@ -1544,7 +1582,7 @@
     "        ][[\"txt\"]].get_values()[0][0]\n",
     "\n",
     "        # Classify query type\n",
-    "        query_type = _get_classifier().run(query).strip().upper()\n",
+    "        query_type = _get_classifier(query).invoke(query).strip().upper()\n",
     "        if query_type == \"SQL\":\n",
     "            return {\n",
     "                \"response\": agent.invoke(query)[\"output\"],\n",
@@ -1575,10 +1613,10 @@
    "id": "ae8df1cb-0cf6-48ba-8a3e-94e5987b1fb3",
    "metadata": {},
    "source": [
-    "<hr style=\"height:2px;border:none;background-color:#00233C;\">\n",
+    "<hr style=\"height:2px;border:none;\">\n",
     "<a id=\"rule\"></a>\n",
-    "<p style = 'font-size:20px;font-family:Arial;color:#00233C'><b>10. Test and Compare Results</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>To test and compare our results let's invoke the agent by selecting question from dropdown.</p>"
+    "<p style = 'font-size:20px;font-family:Arial'><b>10. Test and Compare Results</b></p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>To test and compare our results let's invoke the agent by selecting question from dropdown.</p>"
    ]
   },
   {
@@ -1589,10 +1627,10 @@
    "outputs": [],
    "source": [
     "def response_template(response):\n",
-    "    view = \"\"\"<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>Here is your response:</b></p>\"\"\"\n",
+    "    view = \"\"\"<p style = 'font-size:18px;font-family:Arial;'><b>Here is your response:</b></p>\"\"\"\n",
     "    view = (\n",
     "        view\n",
-    "        + f\"\"\"<ul style = 'font-size:16px;font-family:Arial;color:#00233C'>\n",
+    "        + f\"\"\"<ul style = 'font-size:16px;font-family:Arial;'>\n",
     "    <li><strong>{response['response']}</strong><ul>\n",
     "    <li>References: \"\"\"\n",
     "    )\n",
@@ -1600,7 +1638,7 @@
     "\n",
     "        view = (\n",
     "            view\n",
-    "            + f\"\"\"<ul style = 'font-size:16px;font-family:Arial;color:#00233C'><li>{i}</li></ul>\"\"\"\n",
+    "            + f\"\"\"<ul style = 'font-size:16px;font-family:Arial;'><li>{i}</li></ul>\"\"\"\n",
     "        )\n",
     "\n",
     "    view = view + \"\"\"</ul></ul>\"\"\"\n",
@@ -1725,7 +1763,7 @@
     ")\n",
     "\n",
     "# for 2D plot\n",
-    "doc_chunks = tdf_embeddings_store.loc[:, \"emb_0\":\"emb_383\"].get_values()\n",
+    "doc_chunks = tdf_embeddings_store.loc[:, \"emb_0\":\"emb_767\"].get_values()\n",
     "\n",
     "@interact(query_id=prod_dw)\n",
     "def print_product(query_id):\n",
@@ -1780,9 +1818,9 @@
    "id": "35b9d245-50a1-4ae4-bdb2-d8212950fe39",
    "metadata": {},
    "source": [
-    "<hr style=\"height:2px;border:none;background-color:#00233C;\">\n",
-    "<b style = 'font-size:20px;font-family:Arial;color:#00233C'>11. Integrated data with customer 360</b>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>The following is an example of the output from LLM integrated with existing customer360 data. Please scroll to the right to see all the columns.</p>"
+    "<hr style=\"height:2px;border:none;\">\n",
+    "<b style = 'font-size:20px;font-family:Arial'>11. Integrated data with customer 360</b>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>The following is an example of the output from LLM integrated with existing customer360 data. Please scroll to the right to see all the columns.</p>"
    ]
   },
   {
@@ -1824,10 +1862,10 @@
    "id": "50b8b817-c0ff-42b3-a651-c8268ac40942",
    "metadata": {},
    "source": [
-    "<hr style='height:2px;border:none;background-color:#00233C;'>\n",
-    "<b style = 'font-size:20px;font-family:Arial;color:#00233c'>11. Cleanup</b>\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'><b>11.1 Work Tables</b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>Cleanup work tables to prevent errors next time.</p>"
+    "<hr style='height:2px;border:none;'>\n",
+    "<b style = 'font-size:20px;font-family:Arial'>11. Cleanup</b>\n",
+    "<p style = 'font-size:18px;font-family:Arial'><b>11.1 Work Tables</b></p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>Cleanup work tables to prevent errors next time.</p>"
    ]
   },
   {
@@ -1871,10 +1909,10 @@
    "id": "86d26a8a-23a9-4324-8379-1e6b426ce6e5",
    "metadata": {},
    "source": [
-    "<hr style='height:1px;border:none;background-color:#00233C;'>\n",
+    "<hr style='height:1px;border:none;'>\n",
     "\n",
-    "<p style = 'font-size:18px;font-family:Arial;color:#00233c'> <b>11.2 Databases and Tables </b></p>\n",
-    "<p style = 'font-size:16px;font-family:Arial;color:#00233C'>We will use the following code to clean up tables and databases created for this demonstration.</p>"
+    "<p style = 'font-size:18px;font-family:Arial'> <b>11.2 Databases and Tables </b></p>\n",
+    "<p style = 'font-size:16px;font-family:Arial'>We will use the following code to clean up tables and databases created for this demonstration.</p>"
    ]
   },
   {
@@ -1902,7 +1940,7 @@
    "id": "e5463848-592f-4321-b852-287e133872dd",
    "metadata": {},
    "source": [
-    "<footer style=\"padding-bottom:35px; background:#f9f9f9; border-bottom:3px solid #00233C\">\n",
+    "<footer style=\"padding-bottom:35px; border-bottom:3px solid #91A0Ab\">\n",
     "    <div style=\"float:left;margin-top:14px\">ClearScape Analytics™</div>\n",
     "    <div style=\"float:right;\">\n",
     "        <div style=\"float:left; margin-top:14px\">\n",
@@ -1929,7 +1967,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.9.10"
+   "version": "3.11.14"
   }
  },
  "nbformat": 4,
diff --git a/UseCases/Chat_with_docs_GenAI/commands.json b/UseCases/Chat_with_docs_GenAI/commands.json
deleted file mode 100644
index a8303d31..00000000
--- a/UseCases/Chat_with_docs_GenAI/commands.json
+++ /dev/null
@@ -1,64 +0,0 @@
-{
-  "queries": [
-    {
-      "query": "CREATE DATABASE ivsm as perm=150000000*4;"
-    },
-    {
-      "query": "DATABASE ivsm;"
-    },
-    {
-      "query": "GRANT CREATE EXTERNAL PROCEDURE ON ivsm TO demo_user;"
-    },
-    {
-      "query": "GRANT CREATE FUNCTION ON ivsm TO demo_user;"
-    },
-    {
-      "query": "GRANT DROP FUNCTION ON ivsm TO demo_user;"
-    },
-    {
-      "query": "CALL SQLJ.INSTALL_JAR('cj!./tokenizer-0.0.1-BETA.jar','TOKENIZER',0);"
-    },
-    {
-      "query": "REPLACE FUNCTION ivsm.tokenizer_encode() RETURNS TABLE VARYING USING FUNCTION Encoder_contract SPECIFIC tokenizer_encode LANGUAGE JAVA NO SQL NO EXTERNAL DATA PARAMETER STYLE SQLTable NOT DETERMINISTIC CALLED ON NULL INPUT EXTERNAL NAME 'TOKENIZER:com.teradata.tokenizer.to.Encoder.execute';"
-    },
-    {
-      "query": "REPLACE FUNCTION ivsm.tokenizer_decode() RETURNS TABLE VARYING USING FUNCTION Decoder_contract SPECIFIC tokenizer_decode LANGUAGE JAVA NO SQL NO EXTERNAL DATA PARAMETER STYLE SQLTable NOT DETERMINISTIC CALLED ON NULL INPUT EXTERNAL NAME 'TOKENIZER:com.teradata.tokenizer.to.Decoder.execute';"
-    },
-    {
-      "query": "REPLACE FUNCTION ivsm.vector_to_columns() RETURNS TABLE VARYING USING FUNCTION VectorToColumns_contract SPECIFIC vector_to_columns LANGUAGE JAVA NO SQL NO EXTERNAL DATA PARAMETER STYLE SQLTable NOT DETERMINISTIC CALLED ON NULL INPUT EXTERNAL NAME 'TOKENIZER:com.teradata.tokenizer.to.VectorToColumns.execute';"
-    },
-    {
-      "query": "DATABASE ivsm;"
-    },
-    {
-      "query": "CALL SQLJ.INSTALL_JAR('CJ!./engine-8.0.0.jar','IVSM',0);"
-    },
-    {
-      "query": "REPLACE FUNCTION ivsm.IVSM_score() RETURNS TABLE VARYING USING FUNCTION SMO_contract SPECIFIC IVSM_score LANGUAGE JAVA NO SQL NO EXTERNAL DATA PARAMETER STYLE SQLTable NOT DETERMINISTIC CALLED ON NULL INPUT EXTERNAL NAME 'IVSM:com.teradata.ivsm.engine.SMO.execute()';"
-    },
-    {
-      "query": "CALL SQLJ.ServerControl('JAVA', 'disable', a);"
-    },
-    {
-      "query": "CALL SQLJ.ServerControl('JAVA', 'shutdown', a);"
-    },
-    {
-      "query": "CALL SQLJ.ServerControl('JAVA', 'enable', a);"
-    },
-    {
-      "query": "CALL SQLJ.ServerControl('JAVA', 'status', a);"
-    },
-    {
-      "query": "GRANT EXECUTE FUNCTION ON ivsm.tokenizer_encode TO demo_user;"
-    },
-    {
-      "query": "GRANT EXECUTE FUNCTION ON ivsm.tokenizer_decode TO demo_user;"
-    },
-    {
-      "query": "GRANT EXECUTE FUNCTION ON ivsm.vector_to_columns TO demo_user;"
-    },
-    {
-      "query": "GRANT EXECUTE FUNCTION ON ivsm.IVSM_score TO demo_user;"
-    }
-  ]
-}
diff --git a/UseCases/Chat_with_docs_GenAI/embeddings/all_embeddings_v3.parquet.gzip b/UseCases/Chat_with_docs_GenAI/embeddings/all_embeddings_v3.parquet.gzip
new file mode 100644
index 00000000..c4eb231a
Binary files /dev/null and b/UseCases/Chat_with_docs_GenAI/embeddings/all_embeddings_v3.parquet.gzip differ
diff --git a/UseCases/Chat_with_docs_GenAI/engine-8.0.0.jar b/UseCases/Chat_with_docs_GenAI/engine-8.0.0.jar
deleted file mode 100644
index 1b3a92ff..00000000
Binary files a/UseCases/Chat_with_docs_GenAI/engine-8.0.0.jar and /dev/null differ
diff --git a/UseCases/Chat_with_docs_GenAI/requirements.txt b/UseCases/Chat_with_docs_GenAI/requirements.txt
index 09f810b7..cd5f78e5 100644
--- a/UseCases/Chat_with_docs_GenAI/requirements.txt
+++ b/UseCases/Chat_with_docs_GenAI/requirements.txt
@@ -12,7 +12,6 @@ openai
 pyopenssl
 cryptography
 tdapiclient
-langchain_aws
 pydantic==2.9.2
 fastparquet
 ipywidgets
\ No newline at end of file
diff --git a/UseCases/Chat_with_docs_GenAI/tokenizer-0.0.1-BETA.jar b/UseCases/Chat_with_docs_GenAI/tokenizer-0.0.1-BETA.jar
deleted file mode 100644
index 1199e123..00000000
Binary files a/UseCases/Chat_with_docs_GenAI/tokenizer-0.0.1-BETA.jar and /dev/null differ
diff --git a/UseCases/Chat_with_docs_GenAI/utils/transcripts_helper_func.py b/UseCases/Chat_with_docs_GenAI/utils/transcripts_helper_func.py
index ade52503..e72ca600 100644
--- a/UseCases/Chat_with_docs_GenAI/utils/transcripts_helper_func.py
+++ b/UseCases/Chat_with_docs_GenAI/utils/transcripts_helper_func.py
@@ -2,6 +2,9 @@
 import torchaudio
 from transformers import WhisperProcessor, WhisperForConditionalGeneration
 
+import os
+os.environ["HF_HUB_ENABLE_HF_TRANSFER"] = "0"  # fall back to Python downloader
+os.environ["HF_HUB_DISABLE_PROGRESS_BARS"] = "1"
 
 def load_and_preprocess_audio(file_path, target_sampling_rate=16000):
     # Load the MP3 audio file