3 posts tagged with "strategies"

Deploy Serverless Mistral API on Google Cloud

January 5, 2024 · 4 min read

CEO @ Cloud Code AI

Ahoy, tech enthusiasts and digital buccaneers! Today, we're embarking on a thrilling adventure across the vast oceans of artificial intelligence with our trusty ship, the "Mistral-Docker-API." So, grab your digital compasses and set sail with me as we navigate through the exciting world of deploying AI models using Docker, and eventually docking at the shores of Google Cloud Run.

The Treasure Map: Setting Up Your Ship

Before we hoist the sails, every good pirate needs a map. In our case, it's the README.MD of the mistral-docker-api. This map doesn't lead to buried treasure, but to something even better: deploying the GGUF Mistral model as a container using Docker.

First things first, you need to download the model and store it in your models/ folder. Imagine this model as the secret code to an ancient treasure. You can find this precious artifact at Hugging Face, a place even more mysterious than the Bermuda Triangle!

Once you've got your model, named something like models/mistral-7b-instruct-v0.2.Q4_K_M.gguf, you're ready to build your Docker image. Think of this as building your ship. Run docker build . --tag mistral-api in your command line, and voilà, your ship is ready!

But hey, if you're feeling a bit lazy, like a pirate lounging on the deck, you can just pull the pre-built image using docker pull cloudcodeai/mistral-quantized-api. Then, run it with a simple command: docker run -p 8000:8000 mistral-api. And there you go, your ship is not only built but also sailing!

Here is the link to the treasure map if you are feeling adventurous: 🏴‍☠️ Treasure

The Mystical Inference at the /infer Endpoint

Now, let's talk about the magic happening at the /infer endpoint. It's like finding a talking parrot that can answer any question. You send a message asking, "What's the value of pi?" and the parrot squawks back with an answer so detailed, you'd think it swallowed a math textbook!

But this isn't just any parrot; it's a customizable one! You can tweak its responses with parameters like temperature, top_p, and even max_tokens. It's like teaching your parrot new tricks to impress your pirate friends.

Anchoring at Google Cloud Run

Now, let's talk about docking this ship at Google Cloud Run. Why? Because even pirates need a break from the high seas, and Google Cloud Run is like the perfect tropical island for our container ship.

Prepare Your Container Image: Make sure your Docker image is ready and tested. It's like making sure your ship has no leaks.
Push to Container Registry: Upload your Docker image to Google Container Registry. It's like storing your ship in a safe harbor. If you are lost, worry not, here is our new north start Perplexity AI to guide you!
Create a New Service in Cloud Run: Navigate to Google Cloud Run and create a new service. Choose the image you just pushed to the registry. It's like telling the harbor master where your ship is.
Configure Your Service: Set memory as 32GB, CPU as 8v, and other settings as shown in the map below. It's like stocking up on supplies and making sure your cannons are ready for action.

Cloud Run Configuration 1 Cloud Run Configuration 2 Cloud Run Configuration 3

Deploy and Conquer: Hit deploy and watch as your service goes live. Your API is now sailing on the high clouds, ready to answer queries from all over the world.
Access Your Service: Use the URL provided by Cloud Run to access your service. It's like having a secret map to your hidden cove.

And there you have it, mateys! You've successfully navigated the treacherous waters of AI and Docker, and found a safe harbor in Google Cloud Run. Now, go forth and explore this new world, full of possibilities and adventures. And remember, in the vast sea of technology, there's always more to discover and conquer. Arrr! 🏴‍☠️💻🌊

View From the Analytical Lighthouse

Captain, its an miracle but a slow one!

Our mistral api ship is responding to us but being tiny in such a vast ocean, its responses are pretty slow.

During the first call, it takes 5-6 mins to reply to our input and provide a 400 token long response.

But once its on, it take 1-2 mins to response to our other calls. Hera are the samples.

Cold Start

Warm Start

Future Plans

We plan to make our ship more lean and efficient and make it respond faster.
We want to experiment whats the idea resources to provide so that we can sail multiple ships in the ocean.

Limiting ChatGPT to a specific domain

January 2, 2024 · 4 min read

Shreyash Gupta

CTO @ Cloud Code AI

OpenAI's ChatGPT is a powerful language model capable of generating human-like text. It excels at engaging in open-ended conversations and can respond to various topics. While this versatility is one of ChatGPT's strengths, it can also pose a challenge in specific contexts. Suppose you're deploying ChatGPT as a chatbot in a particular role, such as an insurance agent or customer service representative. In that case, you'll want it to stay on topic and avoid discussing unrelated matters. So, how can you guide ChatGPT to maintain focus on a single subject?

Before we delve into the specifics, let's look at a few techniques for shaping ChatGPT's behavior for your project.:

Prompt Engineering: This involves meticulously crafting the input prompts to guide the model's responses. By adjusting the prompt, you can direct the model to generate outputs in a certain way without additional training.
Fine-Tuning: In this approach, the model is trained further on a specific dataset after it has been pre-trained. This allows the model to adapt better to the style and context of your particular use case.
Using OpenAI’s GPT Builder: Leverage OpenAI’s GPT Builder for a more customizable language model tailored to your needs.

These techniques form the foundation for shaping the behavior of ChatGPT. However, we need a more focused strategy when it comes to keeping the model on a single subject, especially in a role-specific chatbot scenario.

The Strategy: Setting Boundaries and Reinforcing Instructions

Let's explore a simple yet effective approach to achieve this by using Artificial User Messages to provide additional instructions to guide the model's behavior by injecting artificial user messages into the conversation. These messages can be inserted at any point in the conversation to nudge the model gently in a particular direction.

Keeping ChatGPT focused on a single subject is carefully crafting the conversation and consistently reinforcing the chatbot's role and scope.

Here's how to do it:

Step 1: Set the Stage

First, set the temperature and top_p parameters to 0 in the API call. This makes the model's responses more deterministic, keeping it in line with your instructions.

Next, provide the role-specific instructions in the 'system' role message

messages = [
    {
        "role": "system",
        "content": 'You are a clever, funny, and friendly insurance agent \
        focused on making a sale. Do not answer requests or questions not \
        related to it directly.'
    },
    {"role": "user", "content": prompt_value},
]

Step 2: Reinforce the Instructions

ChatGPT sometimes tends to "forget" the instructions in the 'system' role. To reinforce these instructions, include them in the first 'user' message as well.

reinforcing_prompt = {
    "role": "user",
    "content": 'You are a clever, funny, and friendly insurance agent \
    focused on making a sale. Do not answer requests or questions not \
    related to it directly.'
}
messages.insert(1, reinforcing_prompt)

Step 3: Inject Artificial User Messages

Even with the above steps, the model may occasionally drift off-topic. To counter this, we can add an artificial 'user' role message before every new message the actual user sends. This message acts as a gentle reminder for the model to stay on track.

artificial_prompt = {
    "role": "user",
    "content": ''Remember to not answer requests or questions not \
    related directly to making an insurance policy sale.'
}
messages.insert(2, artificial_prompt)

This message should be invisible to the real user and should not be included in the conversation history sent to OpenAI for the rest of the conversation, as it has already served its purpose.

Conclusion

By combining the fundamental techniques of shaping ChatGPT's behavior with strategic use of system and user messages, you can effectively guide the model to stay within defined boundaries. This approach is beneficial when deploying ChatGPT in scenarios where the conversation needs to remain centered around a specific subject, such as customer service, sales, or any role-specific chatbot. With these techniques in your toolbox, you can harness the power of ChatGPT and customize its behavior to suit your specific needs.

Cloud Migration Strategies

December 4, 2023 · 5 min read

Shreyash Gupta

CTO @ Cloud Code AI

In the ever-evolving landscape of technology, businesses are increasingly turning to cloud migration as a strategic initiative to enhance flexibility, scalability, and efficiency. However, the journey to the cloud requires careful planning and execution. In this blog post, we'll explore various cloud migration strategies organizations can adopt for a seamless transition.

What is Cloud Migration?

Cloud migration is a complex process that involves transferring an organization's digital resources, such as data, applications, and IT processes, from traditional on-premises infrastructure to cloud-based environments. This move to the cloud is often driven by the need for increased flexibility, scalability, and cost savings. To achieve a successful migration, organizations need to undertake a thorough planning process, carefully assess their current assets, and adopt appropriate strategies that ensure a smooth and efficient transition to the cloud.

Why should you migrate to the Cloud?

Migrating to the cloud provides numerous advantages for organizations, transforming their operations in multiple ways. The benefits of cloud migration can be summarized as follows:

Cost Efficiency:
By adopting a pay-as-you-go model, organizations can avoid high upfront capital expenses. Cloud providers handle maintenance and security, which reduces operational costs.
Scalability and Flexibility:
With on-demand scaling, organizations can prevent resource over-provisioning. This allows them to expand globally with minimal infrastructure investments.
Agility and Speed:
Cloud services enable swift provisioning, which means organizations can deploy applications faster, without worrying about infrastructure constraints. This fosters innovation.
Reliability and Security:
Cloud providers ensure high availability through robust redundancy and failover mechanisms. They also use strong encryption mechanisms for data protection.
Automatic Maintenance:
Cloud providers handle updates and security configurations seamlessly, which ensures hassle-free maintenance.
Collaboration and Accessibility:
Cloud services facilitate remote work by providing access to data and applications. Real-time collaboration tools also enhance teamwork.
Environmental Sustainability:
Cloud optimization helps reduce energy consumption, which aligns with environmental sustainability goals.
Competitive Edge:
By offloading infrastructure management, organizations can focus on their core competencies. This fosters innovation and competitiveness.

Migrating to the cloud is now a strategic necessity. It offers unparalleled benefits for organizations seeking agility, cost savings, and scalability in the modern business landscape.

Before you migrate

Before moving to the cloud, you need to understand your organization's current state and data architecture. This helps create a tailored migration strategy that optimizes cloud computing to meet your business's specific needs. Map out system complexities, dependencies, and application performance, and assess data volumes and storage requirements. A thorough inventory ensures a smooth transition to the cloud.

Migration Strategies.

Rehosting (Lift and Shift)
The "Lift and Shift" approach, also known as rehosting, is a popular migration strategy that involves moving existing applications and data from on-premises servers to the cloud without making significant changes to their architecture. This strategy is straightforward and low-risk, providing a quick way to migrate. However, it may not fully leverage the benefits of cloud-native features.
Replatforming (Lift, Tinker, and Shift)
Replatforming, also known as “Lift, Tinker, and Shift” is the process of making minor modifications to applications during cloud migration to optimize them for cloud environments. This approach aims to enhance performance, lower costs, and leverage cloud-specific services while limiting the requirement for a complete overhaul.
Refactoring (Re-architecting)
Refactoring or rearchitecting is a comprehensive strategy for organizations looking to maximize the benefits of the cloud. This involves redesigning applications to make the most of cloud-native features, such as microservices architecture, serverless computing, and managed services. While this strategy may be more time-consuming and complex, it can lead to improved scalability, resilience, and cost efficiency in the long run.
Repurchasing (Rebuy)
At times, it can be beneficial for organizations to replace their current applications with commercially available Software as a Service (SaaS) solutions. This approach, referred to as repurchasing or rebuying, enables organizations to delegate the responsibility of maintaining and updating certain applications while taking advantage of the scalability and accessibility of cloud-based SaaS offerings.
Retiring and Retaining
As a part of a migration strategy, organizations need to assess their application portfolio. Some applications may no longer be useful or have cloud-compatible alternatives, so they can be removed. Meanwhile, some applications that are vital to business operations should be retained and moved to the cloud to ensure continuous functionality and support.

Conclusion

To achieve a successful cloud migration, it is essential to have a well-defined and thoughtful strategy that suits the specific needs of the organization. Whether it involves a quick lift and shift or a more comprehensive rearchitecting, having a clear understanding of the available strategies is crucial for making informed decisions. By aligning migration efforts with business objectives and utilizing the right combination of strategies, organizations can unlock the full potential of the cloud, drive innovation, and maintain competitiveness in today's rapidly evolving digital landscape.

Cloudcode.ai can help you migrate to the cloud easily and efficiently. Give it a try to experience the magic!

The Treasure Map: Setting Up Your Ship​

The Mystical Inference at the /infer Endpoint​

Anchoring at Google Cloud Run​

View From the Analytical Lighthouse​

Cold Start​

Warm Start​

Future Plans​

The Strategy: Setting Boundaries and Reinforcing Instructions​

Step 1: Set the Stage​

Step 2: Reinforce the Instructions​

Step 3: Inject Artificial User Messages​

Conclusion​

What is Cloud Migration?​

Why should you migrate to the Cloud?​

Before you migrate​

Migration Strategies.​

Conclusion​

The Treasure Map: Setting Up Your Ship

The Mystical Inference at the /infer Endpoint

Anchoring at Google Cloud Run

View From the Analytical Lighthouse

Cold Start

Warm Start

Future Plans

The Strategy: Setting Boundaries and Reinforcing Instructions

Step 1: Set the Stage

Step 2: Reinforce the Instructions

Step 3: Inject Artificial User Messages

Conclusion

What is Cloud Migration?

Why should you migrate to the Cloud?

Before you migrate

Migration Strategies.

Conclusion