Articles tagged with "AWS"

Embedded Embeddings Database: Building a low cost serverless RAG solution

Retrieval-Augmented Generation (RAG) solutions are an impressive way to talk to one’s data. One of the challenges of RAG solutions is the associated cost, often driven by the vector database. In a previous blog article I presented how to tackle this issue by using Athena with Locality Sensitive Hashing (LSH) as a knowledge database. One the of the main limitations with Athena is the latency and the low number of concurrent queries. In this new blog article, I present a new low-cost serverless solution that makes use of an embedded vector database, SQLite, to achieve a low cost while maintaining high concurrency.

Advanced API Gateway Throttling: Time-Based Adjustments, Budget Alerts, and WAF Configured with Terraform

In the previous post, “API Gateway and Lambda Throttling with Terraform”, we covered the basics of setting up throttling for your API Gateway and Lambda functions. In this follow-up, we’ll take it to the next level, adding budget controls, time-based throttling adjustments, and AWS WAF security integration to safeguard your API while optimizing both performance and cost-efficiency.

API Gateway and Lambda Throttling with Terraform: A Comprehensive Guide

In today’s cloud-native world, effectively managing API and serverless function performance is crucial for building reliable and cost-effective applications. This guide explores advanced throttling techniques for AWS API Gateway and Lambda using Terraform, incorporating best practices from the AWS Well-Architected Framework and real-world implementation patterns.

Comprehensive Guide to AWS AI/ML Services: The Ultimate Decision Maker’s Playbook

Looking to integrate artificial intelligence and machine learning into your business strategy? AWS has an ever-growing suite of AI/ML services, which can sometimes feel overwhelming to navigate. But don’t worry—this playbook is designed to provide a clear, simple guide to help you find the right tools for your business. Explore how AWS AI/ML services can empower your business and simplify your processes—no expert knowledge needed!

Regaining Amazon QuickSight SPICE capacity

When using Business Intelligence solutions, it can quickly become difficult to manage all the resources created by numerous users. In this blog article, we will see how the Amazon QuickSight API was leveraged to determine which resources used up all the available SPICE capacity in our Amazon QuickSight account.

OpenSearch vs. Elasticsearch: Why OpenSearch is the Better Choice for AWS Users in 2024

In the dynamic landscape of search and analytics engines, AWS users often find themselves weighing the merits of OpenSearch against Elasticsearch. While both platforms offer robust search capabilities, Amazon OpenSearch Service emerges as the superior option for those operating within the AWS ecosystem. This comprehensive analysis delves into the reasons why OpenSearch stands out and how it compares to Elasticsearch, particularly for AWS users.