Type something to search...
Wren AI Text-to-SQL: API — the good stuff | by D | Medium

Wren AI Text-to-SQL: API — the good stuff | by D | Medium

Wren AI is an AI pipeline application that translates natural language user queries to SQL also known as Text-to-SQL (also known as Natural Language to SQL or NL2SQL), so you can chat with your database. It features both a UI and an API, allowing you to generate SQL queries based on user prompts and retrieve data using those queries. This significantly simplifies working with structured tabular data in your RAG/Agent applications.

Take a look at their demo https://demo.getwren.ai/

For more details on AI RAG and Agent pipeline applications, please see my previous article.

How Wren AI Works:

Deploying Wren AI is straightforward using the official Wren AI Docker Compose locally. You will need an OpenAI API key and to select the model you plan to use. Alternatively, you can use other OpenAI API-compatible inference engines like LocalAI by setting the OPENAI_API_BASE environment variable for the container. The Wren AI team has also implemented Ollama inference and recommends using Llama3 70b-instruct model. To utilize other than OpenAI models, you’ll need to create ~/.wrenai/.env.ai file following this example. Supported databases currently include BigQuery, DuckDB, PostgreSQL, MySQL, MS SQL, Clickhouse, and even Microsoft Excel Add-in. You can vote for additional database support [here].

### Example: Ollama inference
LLM_PROVIDER=ollama
### ollama. URL should be reachable from Docker Container!!!
OLLAMA_URL=http://host.docker.internal:11434
### https://ollama.com/library/llama3:70b-instruct-q8_0
GENERATION_MODEL=llama3:70b-instruct-q8_0
### supported embedding models providers by qdrant: https://qdrant.tech/documentation/embeddings/
### https://ollama.com/library/mxbai-embed-large:335m-v1-fp16
EMBEDDING_MODEL=mxbai-embed-large:335m-v1-fp16
EMBEDDING_MODEL_DIMENSION=1024
### DOCUMENT_STORE
DOCUMENT_STORE_PROVIDER=qdrant
QDRANT_HOST=qdrant

For Kubernetes deployment, refer to my article here:

Upon completing installing the Wren AI app you can access it via the URL.

Once connected to your database or after uploading a sample dataset, such as the NBA playground, you’ll enter the home page. If you’d like to see the data in the sample dataset you can find it here just replace the version in the URL with the DATASET_VERSION variable listed at the same location.

Configuring your schema and adding descriptive text is crucial in the Modeling menu.

Modeling aids both you and your Large Language Model (LLM) in understanding the data and metadata, resulting in improved query outcomes. To provide better results you must setup relations and provide descriptions since API names of the columns in your tables might not be enough.

With your schema with relations configured and descriptions added in the Modeling menu, head to the Home menu to generate your first Text-to-SQL queries. The system attempts to produce three different SQL queries for you, with the results visible.

The good stuff: API

Moreover, you can access the same functionality via an API, making it an ideal platform for integrating structured tabular data into your RAG/Agent pipeline applications simply by consuming Wren AI as a service. Unfortunately, currently, there’s no mechanism for sending real-time updates from Wren AI back to the client such as WebSocket, Server-Sent-Events, or WebHooks. You’ll have to use Long Polling initiated by the client to constantly check for updates if Wren AI gets the answers ready for you, but the team is working on it.

While there isn’t currently public API documentation available, you can explore the functionality through the following steps by utilizing the Chrome Dev Tools -> Network to observe API interactions.

1. Create an asynchronous task by submitting a question to AI using the `createAskingTask` mutation in GraphQL.

2. Poll the task’s status change using the `askingTask` query.

3. Upon receiving a `FINISHED` status, retrieve three candidates in the payload.

4. Call the `createThread` mutation with the payload obtained.

5. A thread comprises multiple responses, each containing a question and an answer. Query a thread with the `thread` query.

6. The `responses` field in each thread contains an array of responses. Subsequent questions append to this array.

7. Continuously poll the `threadResponse` API to monitor status changes.

8. Upon the status turning `FINISHED`, view the detailed answer in the `detail` field.

9. The `detail` field provides a step-by-step answer seen in the UI.

Currently, Wren AI doesn’t offer a direct way to download or export results with data. As a workaround, consider copying the native SQL, executable within your database, and perform exporting from the database itself.

Summary

Wren AI platform makes it extremely easy for you or your RAG/Agent AI application to work with structured data in your Relational Databases, converting user prompts into valid SQL. UI with a modeling menu is important to provide additional metadata descriptions and schemas for your LLM to better understand your database and its data. Once modeling is done, Wren AI is ready to serve as a middleman in front of your DB. In short, you just speak to your database. This is unheard-of simplicity unachievable before, which makes work with DB highly effective in translating business language into SQL queries directly.

Enjoyed This Story?

If you like this article and you want to support me:

  1. Clap 👏 my article 10 times; that will help me out
  2. Follow me on Medium to get my latest articles 🫶
  3. Follow Wren AI on Medium
  4. Share this article on social media ➡️🌐
  5. Give me feedback in the comments 💬 below. It’ll help me to better understand that this work was useful, even a simple “thanks” or “+” will do. Give me good, give me bad, whatever you think as long as you tell me place to improve and how.

Related Posts

10 Creative Ways to Use ChatGPT Search The Web Feature

10 Creative Ways to Use ChatGPT Search The Web Feature

For example, prompts and outputs Did you know you can use the “search the web” feature of ChatGPT for many tasks other than your basic web search? For those who don't know, ChatGPT’s new

Read More
📚 10 Must-Learn Skills to Stay Ahead in AI and Tech 🚀

📚 10 Must-Learn Skills to Stay Ahead in AI and Tech 🚀

In an industry as dynamic as AI and tech, staying ahead means constantly upgrading your skills. Whether you’re aiming to dive deep into AI model performance, master data analysis, or transform trad

Read More
10 Powerful Perplexity AI Prompts to Automate Your Marketing Tasks

10 Powerful Perplexity AI Prompts to Automate Your Marketing Tasks

In today’s fast-paced digital world, marketers are always looking for smarter ways to streamline their efforts. Imagine having a personal assistant who can create audience profiles, suggest mar

Read More
10+ Top ChatGPT Prompts for UI/UX Designers

10+ Top ChatGPT Prompts for UI/UX Designers

AI technologies, such as machine learning, natural language processing, and data analytics, are redefining traditional design methodologies. From automating repetitive tasks to enabling personal

Read More
100 AI Tools to Finish Months of Work in Minutes

100 AI Tools to Finish Months of Work in Minutes

The rapid advancements in artificial intelligence (AI) have transformed how businesses operate, allowing people to complete tasks that once took weeks or months in mere minutes. From content creat

Read More
17 Mindblowing GitHub Repositories You Never Knew Existed

17 Mindblowing GitHub Repositories You Never Knew Existed

Github Hidden Gems!! Repositories To Bookmark Right Away Learning to code is relatively easy, but mastering the art of writing better code is much tougher. GitHub serves as a treasur

Read More