[{"title":"Improved Browser Testing on Heroku with Chrome","content":"\u003cp\u003eFor developers and businesses offering a web-based product, automated browser testing is a critical tool to ensure continuous delivery of a reliable service. Developers write browser tests by scripting actions against a real browser, simulating real usage by navigating, selecting, and making assertions about web pages and their document elements. \u003c/p\u003e\n\n\u003cp\u003eIn this post, we introduce a new community buildpack that helps with automated browser testing. The new buildpack resolves installation reliability problems in the existing Chrome browser buildpacks for Heroku apps.\u003c/p\u003e\n\n\u003c!-- more --\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='browser-testing-on-heroku' href='#browser-testing-on-heroku'\u003eBrowser Testing on Heroku\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eDevelopers can manually run browser tests on their machines to support writing and debugging tests. They can automate browser tests with continuous integration tools like \u003ca href=\"https://devcenter.heroku.com/articles/heroku-ci-browser-and-user-acceptance-testing-uat\"\u003eHeroku CI\u003c/a\u003e to run in response to code updates and catch new problems on feature branches before they’re merged and released. They can also automate browser tests with a continuous end-to-end testing service. For example, running the test suite every hour to catch new problems with a customer-facing app.\u003c/p\u003e\n\n\u003cp\u003eAt Heroku, we use automated browser testing to ensure the reliability of the \u003ca href=\"https://dashboard.heroku.com/\"\u003eHeroku Dashboard\u003c/a\u003e, our primary web interface. Continuous testing of the dashboard and related interfaces throughout their lifecycle, from feature development to monitoring the production system, is essential for early bug detection, quality assurance, and adaptability. \u003c/p\u003e\n\n\u003cp\u003eHeroku engineers found one long-standing issue regularly disrupts browser testing. Occasionally, automated Chrome browser tests all fail due to a version mismatch of the installed Chrome and Chromedriver components, like this example error message:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode\u003eThis version of ChromeDriver only supports Chrome version N\nCurrent browser version is M\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eWhile it seems like the answer is to set a specific version number, Chrome is an evergreen browser. The browser continuously refreshes itself with security updates and features. Setting a specific version is discouraged because the browser quickly falls out of date.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='introducing-a-new-community-buildpack' href='#introducing-a-new-community-buildpack'\u003eIntroducing A New Community Buildpack\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eTo solve this cycle of version mismatches as Chrome updates itself, we created the \u003ca href=\"https://elements.heroku.com/buildpacks/heroku/heroku-buildpack-chrome-for-testing\"\u003eChrome for Testing Heroku Buildpack\u003c/a\u003e. We were able to release this buildpack because the Chrome development team \u003ca href=\"https://developer.chrome.com/blog/chrome-for-testing/\"\u003eaddressed the long-standing problem of keeping Chrome and Chromedriver versions updated and aligned\u003c/a\u003e with each other for automated testing environments.\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eTo use this new Chrome for Testing buildpack in Heroku apps, head over to the Heroku Elements Marketplace and install the \u003ca href=\"https://elements.heroku.com/buildpacks/heroku/heroku-buildpack-chrome-for-testing\"\u003eChrome for Testing Heroku Buildpack\u003c/a\u003e.\u003c/strong\u003e\u003c/p\u003e\n\n\u003cp\u003eIf the app is already using Chrome, make sure to remove existing Chrome and Chromedriver buildpacks before installing the new buildpack. To install Chrome for Testing on an app, add \u003ccode\u003eheroku-community/chrome-for-testing\u003c/code\u003e as the first buildpack:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode\u003eheroku buildpacks:add -i 1 heroku-community/chrome-for-testing\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eBy default, this buildpack downloads the latest \u003ccode\u003eStable\u003c/code\u003e release, which \u003ca href=\"https://googlechromelabs.github.io/chrome-for-testing/\"\u003eGoogle provides\u003c/a\u003e. You can control the channel of the release by setting the app’s \u003ccode\u003eGOOGLE_CHROME_CHANNEL\u003c/code\u003e config variable to \u003ccode\u003eStable\u003c/code\u003e, \u003ccode\u003eBeta\u003c/code\u003e, \u003ccode\u003eDev\u003c/code\u003e, or \u003ccode\u003eCanary\u003c/code\u003e, and then deploy and build the app.\u003c/p\u003e\n\n\u003cp\u003eAfter the app deploys with the Chrome for Testing buildpack, \u003ccode\u003echrome\u003c/code\u003e and \u003ccode\u003echromedriver\u003c/code\u003e executables are installed on the \u003ccode\u003ePATH\u003c/code\u003e in dynos, available for browser automation tools like \u003ca href=\"https://www.selenium.dev/documentation/webdriver/\"\u003eSelenium WebDriver\u003c/a\u003e and \u003ca href=\"https://pptr.dev/\"\u003ePuppeteer\u003c/a\u003e. We welcome feedback about this buildpack on its \u003ca href=\"https://github.com/heroku/heroku-buildpack-chrome-for-testing\"\u003eGitHub repository\u003c/a\u003e. Happy testing!\u003c/p\u003e\n","published_at":"2024-04-09T17:00:53.859Z","permalink":"https://blog.heroku.com/improved-browser-testing-on-heroku-with-chrome","tags":["testing","Heroku CI","continuous integration","buildpack"],"summary":"Introducing a new community buildpack for automated browser testing with Chrome on Heroku."},{"title":"Building a GPT Backed by a Heroku-Deployed API","content":"\u003ch2 class='anchored'\u003e\n  \u003ca name='how-to-connect-your-gpt-on-openai-to-a-backend-node-js-app' href='#how-to-connect-your-gpt-on-openai-to-a-backend-node-js-app'\u003eHow to connect your GPT on OpenAI to a backend Node.js app\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eLate in 2023, \u003ca href=\"https://openai.com/blog/introducing-gpts\"\u003eOpenAI introduced GPTs\u003c/a\u003e, a way for developers to build customized versions of ChatGPT that can bundle in specialized knowledge, follow preset instructions, or perform actions like reaching out to external APIs.  As more and more businesses and individuals use ChatGPT, developers are racing to build powerful GPTs to ride the wave of ChatGPT adoption.\u003c/p\u003e\n\n\u003c!-- more --\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711485326-image10.png\" alt=\"Introducing GPTs\"\u003e\u003c/p\u003e\n\n\u003cp\u003e\u003ca href=\"https://openai.com/blog/introducing-gpts\"\u003eSource\u003c/a\u003e\u003c/p\u003e\n\n\u003cp\u003eIf you’re thinking about diving into GPT development, we’ve got some good news: Building a powerful GPT mostly involves building an API that handles a few endpoints. And in this post, we’ll show you how to do it.\u003c/p\u003e\n\n\u003cp\u003eIn this walk-through, we’ll build a simple API server with Node.js. We’ll deploy our API to Heroku for simplicity and security. Then, we’ll show you how to create and configure a GPT that reaches out to your API. This project is part of our \u003ca href=\"https://github.com/heroku-reference-apps\"\u003eHeroku Reference Applications\u003c/a\u003e GitHub organization where we host different projects showcasing architectures and patterns to deploy to Heroku.\u003c/p\u003e\n\n\u003cp\u003eThis is going to be a fun one. Let’s do it!\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='our-gpt-an-employee-directory' href='#our-gpt-an-employee-directory'\u003eOur GPT: An Employee Directory\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eImagine your organization uses ChatGPT internally for some of its operations. You want to provide your users (employees) with a convenient way to search through the employee database. These users aren’t tech-savvy. What’s an SQL query anyway?\u003c/p\u003e\n\n\u003cp\u003eWith natural language, our users will ask our custom GPT a question about employees in the company. For example, they might ask: “\u003cem\u003eWho do we have in the marketing department that was hired in 2021?”\u003c/em\u003e\u003c/p\u003e\n\n\u003cp\u003eThe end user doesn’t know (or care) about databases, queries, or result rows. Our GPT will send a request to our API. Our API will find the requested information and return a natural language response, which our GPT sends back to the end user.\u003c/p\u003e\n\n\u003cp\u003eHere’s how it looks:\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711485416-image6.png\" alt=\"GPT example\"\u003e\u003c/p\u003e\n\n\u003cp\u003ePretty cool, right? The basic flow looks like this:\u003c/p\u003e\n\n\u003col\u003e\n\u003cli\u003eIn the ChatGPT interface, the user asks our GPT a question related to the employee directory.\u003c/li\u003e\n\u003cli\u003eThe GPT sends a POST request containing the user’s question to our API.\u003c/li\u003e\n\u003cli\u003eOur API calls OpenAI’s Chat Completions API to help translate the user’s question into a well-formed SQL query.\u003c/li\u003e\n\u003cli\u003eOur API uses the SQL query to fetch results from the employee database.\u003c/li\u003e\n\u003cli\u003eOur API calls OpenAI’s Chat Completions API to process the query results into a natural language response.\u003c/li\u003e\n\u003cli\u003eOur API passes this response back to the GPT.\u003c/li\u003e\n\u003cli\u003eChatGPT presents the response to the user.\u003c/li\u003e\n\u003c/ol\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711485480-image4.png\" alt=\"Architecture diagram\"\u003e\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eNote\u003c/strong\u003e: In the architecture above, all the data is leaving the Heroku trust boundary to access OpenAI services, take this into account when building data-sensitive applications. \u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='prerequisites-and-initial-steps' href='#prerequisites-and-initial-steps'\u003ePrerequisites and Initial Steps\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003e\u003cstrong\u003eNote\u003c/strong\u003e: If you want to try the application first, deploy it using the “Deploy to Heroku” button in the reference application’s \u003ca href=\"https://github.com/heroku-reference-apps/employee-directory-gpt-action/blob/main/README.md\"\u003eREADME\u003c/a\u003e file.\u003c/p\u003e\n\n\u003cp\u003eBefore you can get started, you’ll need a few things in place:\u003c/p\u003e\n\n\u003col\u003e\n\u003cli\u003eAn \u003ca href=\"https://openai.com/\"\u003eOpenAI account\u003c/a\u003e. You’ll need to add a payment method and purchase a small amount of credit since you want access to its APIs.\u003c/li\u003e\n\u003cli\u003eOnce you have your OpenAI account set up, you’ll need to create a \u003ca href=\"https://platform.openai.com/api-keys\"\u003esecret API key\u003c/a\u003e and copy it down. Your API application will need this key to authenticate its requests to the OpenAI API.\u003c/li\u003e\n\u003cli\u003eA \u003ca href=\"https://signup.heroku.com/\"\u003eHeroku account\u003c/a\u003e. You’ll need to add a payment method to cover your compute and database costs. For building and testing this API, we recommend using an \u003ca href=\"https://devcenter.heroku.com/articles/eco-dyno-hours\"\u003eeco dyno\u003c/a\u003e, which has a $5 monthly flat fee. It’ll supply you with more than enough hours for initial development. You’ll also need \u003ca href=\"https://devcenter.heroku.com/articles/heroku-postgresql\"\u003eHeroku Postgres\u003c/a\u003e. You can use the Mini plan, at $0.007/hour, which is enough for this application.\u003c/li\u003e\n\u003cli\u003eA \u003ca href=\"https://github.com/\"\u003eGitHub account\u003c/a\u003e for your code repository. Heroku will hook into your GitHub repo directly, simplifying deployment to a single click.\u003c/li\u003e\n\u003cli\u003eClone the \u003ca href=\"https://github.com/heroku-reference-apps/employee-directory-chatgpt-plugin\"\u003eGitHub repo\u003c/a\u003e with the code for the API application.\u003c/li\u003e\n\u003c/ol\u003e\n\n\u003cp\u003e\u003cstrong\u003eNote\u003c/strong\u003e: Every request incurs costs and the price varies depending on the selected model. For example, using the GPT-3 model, in order to spend $1, you\u0026#39;d have to ask more than 20,000 questions. See the \u003ca href=\"https://openai.com/pricing\"\u003eOpenAI API pricing\u003c/a\u003e page for more information.\u003c/p\u003e\n\n\u003cp\u003eThe README in the repo has all the instructions you need to get the API server deployed to Heroku. If you just want to get your GPT up and running quickly, skip down to the \u003cstrong\u003eCreate and Configure GPT\u003c/strong\u003e section Otherwise, you can follow along to walk through how to build this API.\u003c/p\u003e\n\n\u003cp\u003eWe used Node \u003ccode\u003ev20.10.0\u003c/code\u003e and \u003ccode\u003eyarn\u003c/code\u003e as our package manager. Install your dependencies.\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"sh\"\u003eyarn install\n\u003c/code\u003e\u003c/pre\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='build-the-api' href='#build-the-api'\u003eBuild the API\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eOne of the most powerful ways to use OpenAI’s custom GPTs is by building an API that your GPT reaches out to. Here’s how OpenAI’s \u003ca href=\"https://openai.com/blog/introducing-gpts\"\u003eblog post introducing GPTs\u003c/a\u003e describes it:\u003c/p\u003e\n\n\u003cp\u003eIn addition to using our built-in capabilities, you can also define custom actions by making one or more APIs available to the GPT… Connect GPTs to databases, plug them into emails, or make them your shopping assistant. For example, you could integrate a travel listings database, connect a user’s email inbox, or facilitate e-commerce orders.\u003c/p\u003e\n\n\u003cp\u003eSo, even though we’re building a GPT, under the hood we are simply building an API. For this, we use \u003ca href=\"https://expressjs.com/\"\u003eExpress\u003c/a\u003e and listen for POST requests to the \u003ccode\u003e/search\u003c/code\u003e endpoint. We can build and test our API as a standalone unit before creating our GPT and custom action.\u003c/p\u003e\n\n\u003cp\u003eLet’s look at \u003ccode\u003esrc/index.js\u003c/code\u003e for how our server will handle POST requests to \u003ccode\u003e/search\u003c/code\u003e. To keep our code snippet easily readable, we’ve left out the logging and error handling:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"language-javascript\"\u003eserver.post(\u0026#39;/\u0026#39;, authMiddleware, async (req, res) =\u0026gt; {\n  …\n  const userPrompt = req.body.message\n  const sql = await AI.craftQuery(userPrompt)\n  let rows = []\n  …\n  rows = await db.query(sql)\n  …\n  const results = await AI.processResult(userPrompt, sql, rows)\n  res.send(results)\n})\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eAs you can see, the major steps we need to cover are:\u003c/p\u003e\n\n\u003col\u003e\n\u003cli\u003eAsk OpenAI to craft an SQL query.\u003c/li\u003e\n\u003cli\u003eQuery the database.\u003c/li\u003e\n\u003cli\u003eAsk OpenAI to turn the query results into a natural language response.\u003c/li\u003e\n\u003c/ol\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='using-openai-s-chat-completions-api' href='#using-openai-s-chat-completions-api'\u003eUsing OpenAI’s Chat Completions API\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eBecause our API will need to do some natural language processing, it will make some calls to OpenAI’s Chat Completions API. Not every API needs to do this. Imagine a simple API that just needs to return the current date and time. It doesn’t need to rely on OpenAI for its business logic.\u003c/p\u003e\n\n\u003cp\u003eBut \u003cem\u003eour\u003c/em\u003e GPT’s supporting API will need the Chat Completions API for basic \u003ca href=\"https://platform.openai.com/docs/guides/text-generation\"\u003etext generation\u003c/a\u003e.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='the-first-call-to-openai-generate-an-sql-query' href='#the-first-call-to-openai-generate-an-sql-query'\u003eThe first call to OpenAI: generate an SQL query\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eAs per our flow (see the diagram above), we’ll need to ask OpenAI to convert the user’s original question into an SQL query. Let’s look at \u003ccode\u003esrc/ai.js\u003c/code\u003e to see how we do this.\u003c/p\u003e\n\n\u003cp\u003eWhen sending a request to the Chat Completions API, we send an array of messages to help ChatGPT understand the context, including what’s being requested and how we want ChatGPT to behave in its response. Our first message is a \u003ccode\u003esystem\u003c/code\u003e message, where we set the stage for ChatGPT.\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"language-javascript\"\u003econst PROMPT = `\n  I have a psql db with an \u0026quot;employees\u0026quot; table, created with the following statements:\n\n  create type department_enum as enum(\u0026#39;Accounting\u0026#39;,\u0026#39;Sales\u0026#39;,\u0026#39;Engineering\u0026#39;,\u0026#39;Marketing\u0026#39;,\u0026#39;Product\u0026#39;,\u0026#39;Custom\ner Service\u0026#39;,\u0026#39;HR\u0026#39;);\n\n  create type title_enum as enum(\u0026#39;Assistant\u0026#39;, \u0026#39;Manager\u0026#39;, \u0026#39;Junior Executive\u0026#39;, \u0026#39;President\u0026#39;, \u0026#39;Vice-President\u0026#39;, \u0026#39;Associate\u0026#39;, \u0026#39;Intern\u0026#39;, \u0026#39;Contractor\u0026#39;);\n\n  create table employees(id char(36) not null unique primary key, first_name varchar(64) not null, last_name varchar(64) not null, email text not null, department department_enum not null, title title_enum not null, hire_date date not null);\n`.trim()\n\nconst SYSTEM_MESSAGE = { role: \u0026#39;system\u0026#39;, content: PROMPT }\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eOur \u003ccode\u003ecraftQuery\u003c/code\u003e function looks like this:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"language-javascript\"\u003econst craftQuery = async (userPrompt) =\u0026gt; {\n  const settings = {\n    messages: [SYSTEM_MESSAGE],\n    model: CHATGPT_MODEL,\n    temperature: TEMPERATURE,\n    response_format: {\n    type: \u0026#39;json_object\u0026#39;\n    }\n  }\n\n  settings.messages.push({\n    role: \u0026#39;system\u0026#39;,\n    content: \u0026#39;Output JSON with the query under the \u0026quot;sql\u0026quot; key.\u0026#39;\n  })\n\n  settings.messages.push({\n    role: \u0026#39;user\u0026#39;,\n    content: userPrompt\n  })\n  settings.messages.push({\n    role: \u0026#39;user\u0026#39;,\n    content: \u0026#39;Provide a single SQL query to obtain the desired result.\u0026#39;\n  })\n\n  logger.info(\u0026#39;craftQuery sending request to openAI\u0026#39;)\n\n  const response = await openai.chat.completions.create(settings)\n  const content = JSON.parse(response.choices[0].message.content)\n  return content.sql\n}\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eLet’s walk through what this code does in detail. First, we put together the set of messages that we’ll send to ChatGPT:\u003c/p\u003e\n\n\u003col\u003e\n\u003cli\u003eThe initial \u003ccode\u003esystem\u003c/code\u003e message that lays out how we have structured our database, so that ChatGPT knows column names and constraints when crafting a query.\u003c/li\u003e\n\u003cli\u003eA \u003ccode\u003esystem\u003c/code\u003e message that tells ChatGPT the format/structure we want for the response. In this case, we want the response as JSON (not natural language), with the SQL query under the key called \u003ccode\u003esql\u003c/code\u003e.\u003c/li\u003e\n\u003cli\u003eA \u003ccode\u003euser\u003c/code\u003e message, which is the end user’s original request.\u003c/li\u003e\n\u003cli\u003eA follow-up \u003ccode\u003euser\u003c/code\u003e message, where we specifically ask ChatGPT to generate a single SQL query for us, based on what we’re looking for.\u003c/li\u003e\n\u003c/ol\u003e\n\n\u003cp\u003eWe use the \u003ccode\u003e\u003ca href=\"https://www.npmjs.com/package/openai\"\u003eopenai\u003c/a\u003e\u003c/code\u003e package (not shown) for Node.js. This is the official JavaScript library for OpenAI, serving as a convenient wrapper around the OpenAI API. With our \u003ccode\u003esettings\u003c/code\u003e place, we call the \u003ccode\u003e\u003ca href=\"https://platform.openai.com/docs/api-reference/chat/create\"\u003ecreate\u003c/a\u003e\u003c/code\u003e function to generate a response. Then, we return the \u003ccode\u003esql\u003c/code\u003e statement (in the JSON object) from OpenAI’s response.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='use-sql-to-query-the-database' href='#use-sql-to-query-the-database'\u003eUse SQL to query the database\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eBack in \u003ccode\u003esrc/index.js\u003c/code\u003e, we use the SQL statement from OpenAI to query our database. We wrote a small module (\u003ccode\u003esrc/db.js\u003c/code\u003e) to handle connecting with our PostgreSQL database and sending queries.\u003c/p\u003e\n\n\u003cp\u003eOur call to \u003ccode\u003edb.query(sql)\u003c/code\u003e returns the query result, an array called \u003ccode\u003erows\u003c/code\u003e.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='the-second-call-to-openai-process-the-query-results' href='#the-second-call-to-openai-process-the-query-results'\u003eThe second call to OpenAI: process the query results\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eAlthough our API \u003cem\u003ecould\u003c/em\u003e send back the raw database query results to the end user, it would be a better user experience if we turned those results into a human-readable response. Our user doesn’t need to know that there was a database involved. A natural language response would be ideal.\u003c/p\u003e\n\n\u003cp\u003eSo, we’ll send another request to the Chat Completions API. In \u003ccode\u003esrc/ai.js\u003c/code\u003e, we have a function called \u003ccode\u003eprocessResult\u003c/code\u003e:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"language-javascript\"\u003econst processResult = async (userPrompt, sql, rows) =\u0026gt; {\n  const settings = {\n    messages: [SYSTEM_MESSAGE],\n    model: CHATGPT_MODEL,\n    temperature: TEMPERATURE\n  }\n\n  const userMessage = `\n    This is how I described I was looking for: ${userPrompt}\n\n    This is the query sent to find the results: ${sql}\n\n    Here is the resulting data that you found:\n    ${JSON.stringify(rows)}\n\nAssume I am not even aware that a database query was run. Do not include the SQL query in your response to me. If the original request does not explicitly specify a sort order, then sort the results in the most natural way. Return the resulting data to me in a human-readable way, not as an object or an array. Keep your response direct. Tell me what you found and how it is sorted.\u0026#39;\n  `\n  settings.messages.push({\n    role: \u0026#39;user\u0026#39;,\n    content: userMessage\n  })\n\n  logger.info(\u0026#39;processResult sending request to openAI\u0026#39;)\n\n  const response = await openai.chat.completions.create(settings)\n  return response.choices[0].message.content\n}\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eAgain, we start with an initial \u003ccode\u003esystem\u003c/code\u003e message that gives ChatGPT information about our database. At this point, you might ask: \u003cem\u003eDidn’t we already do that? Why do we need to tell ChatGPT about our database structure \u003cspan style=\"text-decoration:underline;\"\u003eagain\u003c/span\u003e?\u003c/em\u003e The answer is in the \u003ca href=\"https://platform.openai.com/docs/guides/text-generation/chat-completions-api\"\u003eChat Completions API documentation\u003c/a\u003e:\u003c/p\u003e\n\n\u003cp\u003e\u003cem\u003eIncluding conversation history is important when user instructions refer to prior messages…. Because the models have no memory of past requests, all relevant information must be supplied as part of the conversation history in \u003cstrong\u003eeach\u003c/strong\u003e request.\u003c/em\u003e\u003c/p\u003e\n\n\u003cp\u003eAlong with the database structure, we want to provide ChatGPT with some more context. In \u003ccode\u003euserMessage\u003c/code\u003e, we include:\u003c/p\u003e\n\n\u003col\u003e\n\u003cli\u003eThe user’s original question (\u003ccode\u003euserPrompt\u003c/code\u003e), so ChatGPT knows what question it is ultimately answering.\u003c/li\u003e\n\u003cli\u003eThe \u003ccode\u003esql\u003c/code\u003e query that we used to fetch the results from the database.\u003c/li\u003e\n\u003cli\u003eThe database query results (\u003ccode\u003erows\u003c/code\u003e).\u003c/li\u003e\n\u003cli\u003eClear instructions about what we want ChatGPT to do now—that is, “\u003cem\u003ereturn the resulting data to me in a human-readable way\u003c/em\u003e” (along with some other guidelines).\u003c/li\u003e\n\u003c/ol\u003e\n\n\u003cp\u003eSimilar to before, we send these \u003ccode\u003esettings\u003c/code\u003e to the \u003ccode\u003ecreate\u003c/code\u003e function, and then pass the response content up to the caller.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='other-implementation-details-not-shown' href='#other-implementation-details-not-shown'\u003eOther implementation details (not shown)\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eThe code snippets we’ve shown cover the major implementation details for our API development. You can always take a look at the \u003ca href=\"https://github.com/heroku-reference-apps/employee-directory-gpt-action\"\u003eGitHub repo\u003c/a\u003e to see all the code, line by line. Some details that we didn’t cover here are:\u003c/p\u003e\n\n\u003cul\u003e\n\u003cli\u003eCreating a PostgreSQL database with an \u003ccode\u003eemployees\u003c/code\u003e table and populating it with dummy data. See the \u003ccode\u003edata/create_schema.sql\u003c/code\u003e and \u003ccode\u003edata/create_records.sql\u003c/code\u003e for this.\u003c/li\u003e\n\u003cli\u003eImplementing bearer auth for our API (see \u003ccode\u003esrc/auth.js\u003c/code\u003e). Requests to our API must attach an API key that we generate. We store this API key as an environment variable called \u003ccode\u003eBEARER_AUTH_API_KEY\u003c/code\u003e. We’ll discuss this lower down when configuring our GPT.\u003c/li\u003e\n\u003cli\u003eWriting basic unit tests with \u003ca href=\"https://jestjs.io/\"\u003eJest\u003c/a\u003e.\u003c/li\u003e\n\u003cli\u003e\u003ca href=\"https://eslint.org/\"\u003eESLint\u003c/a\u003e and \u003ca href=\"https://prettier.io/\"\u003ePrettier\u003c/a\u003e configurations to keep our code clean and readable.\u003c/li\u003e\n\u003c/ul\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='testing-our-api-s-business-logic' href='#testing-our-api-s-business-logic'\u003eTesting our API’s business logic\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eWith all of our code in place, we can test our API by sending a POST request, just like our GPT would send a request when a user makes a query. When we start our server locally, we make sure to have a \u003ccode\u003e.env\u003c/code\u003e file that contains the environment variables that our API will need:\u003c/p\u003e\n\n\u003cul\u003e\n\u003cli\u003e\u003ccode\u003eOPENAI_API_KEY\u003c/code\u003e: The \u003ccode\u003eopenai\u003c/code\u003e JavaScript package uses this to authenticate requests we send to the Chat Completions API.\u003c/li\u003e\n\u003cli\u003e\u003ccode\u003eBEARER_AUTH_API_KEY\u003c/code\u003e: This is the API key that a caller of \u003cem\u003eour\u003c/em\u003e API will need to provide for authentication.\u003c/li\u003e\n\u003cli\u003e\u003ccode\u003eDATABASE_URL\u003c/code\u003e: The PostgreSQL connection string for our database.\u003c/li\u003e\n\u003c/ul\u003e\n\n\u003cp\u003eAn example \u003ccode\u003e.env\u003c/code\u003e file might look like this:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"txt\"\u003eOPENAI_API_KEY=sk-Kie************************************************\nBEARER_AUTH_API_KEY=thisismysecretAPIkey\nDATABASE_URL=postgres://db_user:db_pass@localhost:5432/company_hr_db\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eWe start our server:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"sh\"\u003enode index.js\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eIn a separate terminal, we send a curl request to our API:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"sh\"\u003ecurl -X POST \\\n  --header \u0026quot;Content-type:application/json\u0026quot; \\\n  --header \u0026quot;Authorization: Bearer thisismysecretAPIkey\u0026quot; \\\n  --data \u0026quot;{\\\u0026quot;message\\\u0026quot;:\\\u0026quot;Please find names and hire dates of any employees in the marketing department hired after 2018. Sort them by hire date.\\\u0026quot;}\u0026quot; \\\n  http://localhost:3000/search\n\nI found the names and hire dates of employees in the marketing department who were hired after 2018. The data is sorted by hire date in ascending order. Here are the results:\n\n- Jailyn McClure, hired on 2019-02-21\n- Leopold Johnston, hired on 2019-02-21\n- Francis Kris, hired on 2019-10-09\n- Jerad Strosin, hired on 2019-10-22\n- Daniela Boehm, hired on 2020-05-25\n- Joe Torp, hired on 2020-05-31\n- Harry Heaney, hired on 2020-08-16\n- Anabel Sporer, hired on 2020-12-22\n- Carson Gislason, hired on 2020-12-25\n- Bud Farrell, hired on 2021-05-04\n- Katelynn Swaniawski, hired on 2021-07-13\n- Ernesto Baumbach, hired on 2021-08-15\n- Gwendolyn DuBuque, hired on 2021-10-10\n- Willow Green, hired on 2021-11-20\n- Rodrigo Fay, hired on 2022-07-04\n- Makayla Crooks, hired on 2022-08-02\n- Gerry Boehm, hired on 2022-09-28\n- Gretchen Mertz, hired on 2023-02-15\n- Chloe Bayer, hired on 2023-03-30\n- Alek Herman, hired on 2023-05-25\n- Eloy Flatley, hired on 2023-08-25\n- Zackery Welch, hired on 2023-09-08\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eOur API works as expected! It interpreted our request, queried the database successfully, and then returned results in a human-readable format.\u003c/p\u003e\n\n\u003cp\u003eNow it’s time to create our custom GPT.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='deploy-to-heroku' href='#deploy-to-heroku'\u003eDeploy to Heroku\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eFirst, we need to deploy our API application to Heroku.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='step-1-create-a-new-heroku-app' href='#step-1-create-a-new-heroku-app'\u003eStep 1: Create a new Heroku app\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eAfter logging in to Heroku, go to the Heroku dashboard and click \u003cstrong\u003eCreate new app\u003c/strong\u003e.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486129-image12.png\" alt=\"Create new app\"\u003e\u003c/p\u003e\n\n\u003cp\u003eProvide a name for your app. Then, click \u003cstrong\u003eCreate app\u003c/strong\u003e.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486162-image21.png\" alt=\"Provide a name for the app\"\u003e\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='step-2-connect-your-heroku-app-to-your-project-repository' href='#step-2-connect-your-heroku-app-to-your-project-repository'\u003eStep 2: Connect your Heroku app to your project repository\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eWith your Heroku app created, connect it to the GitHub repository for your project.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486209-image8.png\" alt=\"Connect to GitHub\"\u003e\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='step-3-add-heroku-postgres' href='#step-3-add-heroku-postgres'\u003eStep 3: Add Heroku Postgres\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eYou’ll also need a PostgreSQL database running alongside your API. Go to your app’s \u003cstrong\u003eResources\u003c/strong\u003e page and search the add-ons for “postgres.”\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486237-image11.png\" alt=\"Add Heroku Postgres addon\"\u003e\u003c/p\u003e\n\n\u003cp\u003eSelect the “Mini” plan and submit the order form.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486274-image19.png\" alt=\"Select Heroku Postgres plan\"\u003e\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='step-4-set-up-app-config-vars' href='#step-4-set-up-app-config-vars'\u003eStep 4: Set up app config vars\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eYou’ll recall that our API depends on a few environment variables (in \u003ccode\u003e.env\u003c/code\u003e). When deploying to Heroku, you can set these up by going to your app \u003cstrong\u003eSettings\u003c/strong\u003e, \u003cstrong\u003eConfig Vars\u003c/strong\u003e. Add a new config var called \u003ccode\u003eOPENAI_API_KEY\u003c/code\u003e, and paste in the value you copied from OpenAI.\u003c/p\u003e\n\n\u003cp\u003eNotice that Heroku has added a \u003ccode\u003eDATABASE_URL\u003c/code\u003e config var based on your Heroku Postgres add-on. Convenient!\u003c/p\u003e\n\n\u003cp\u003eFinally, you need to add a config var called \u003ccode\u003eBEARER_AUTH_API_KEY\u003c/code\u003e. This is the key that any caller of our API (including ChatGPT, through our custom GPT’s action) will need to provide for authentication. You can set this to any value you want. We used an online random password generator to generate a string.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486318-image9.png\" alt=\"Configuring environment variables\"\u003e\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='step-5-seed-the-database' href='#step-5-seed-the-database'\u003eStep 5: Seed the database\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eDon’t forget to seed your newly running Heroku Postgres database with the dummy data. Assuming you have the \u003ca href=\"https://devcenter.heroku.com/articles/heroku-cli\"\u003eHeroku CLI\u003c/a\u003e installed, accessing your database add-on is incredibly convenient. Set up your database with the following:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode\u003eheroku pg:psql \u0026lt; create_schema.sql\nheroku pg:psql \u0026lt; create_records.sql\n\u003c/code\u003e\u003c/pre\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='step-6-deploy' href='#step-6-deploy'\u003eStep 6: Deploy\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eGo to the \u003cstrong\u003eDeploy\u003c/strong\u003e tab for your Heroku app. Click \u003cstrong\u003eDeploy Branch\u003c/strong\u003e. Heroku takes the latest commit on the main branch, installs dependencies, and then starts the server (\u003ccode\u003eyarn start\u003c/code\u003e). You can deploy your API in seconds with just one click.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486365-image16.png\" alt=\"Deploy from the dashboard\"\u003e\u003c/p\u003e\n\n\u003cp\u003eAfter you’ve deployed your application, click \u003cstrong\u003eOpen app\u003c/strong\u003e\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486415-image5.png\" alt=\"Open app\"\u003e\u003c/p\u003e\n\n\u003cp\u003eOpening your app to the default page will show a Swagger UI interface with the API specification for our app. We get this by adding functionality from the \u003ccode\u003e\u003ca href=\"https://www.npmjs.com/package/swagger-ui-express\"\u003eswagger-ui-express\u003c/a\u003e\u003c/code\u003e package.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486541-image17.png\" alt=\"Swagger UI\"\u003e\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='create-and-configure-gpt' href='#create-and-configure-gpt'\u003eCreate and Configure GPT\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eCreating a GPT is quick and easy. When you’re logged into \u003ca href=\"https://chat.openai.com/\"\u003ehttps://chat.openai.com/\u003c/a\u003e, click \u003cstrong\u003eExplore GPTs\u003c/strong\u003e in the left-hand navigation. Then, click the \u003cstrong\u003e+ Create\u003c/strong\u003e button.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='configure-the-initial-settings' href='#configure-the-initial-settings'\u003eConfigure the initial settings\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eThere are two tabs you can navigate when creating a GPT. The \u003cstrong\u003eCreate\u003c/strong\u003e tab is a wizard-style interface where you interact with the GPT Builder to solidify what you want your GPT to do. Since we already know what we want to do, we will configure our GPT directly. Click the \u003cstrong\u003eConfigure\u003c/strong\u003e tab.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486575-image22.png\" alt=\"Configure GPT\"\u003e\u003c/p\u003e\n\n\u003cp\u003eWe provide a name, description, and basic instructions for our GPT. We also upload the logo for our GPT. The codebase has a logo you can use: \u003ccode\u003eresources/logo.png\u003c/code\u003e.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486620-image2.png\" alt=\"New GPT\"\u003e\u003c/p\u003e\n\n\u003cp\u003eFor “Capabilities”, we can uncheck all of the options, as our GPT will not need to use them.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486642-image1.png\" alt=\"GPT Capabilities\"\u003e\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='create-new-action' href='#create-new-action'\u003eCreate new action\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eThe “meat” of our GPT will be an action that calls our Heroku-deployed API. At the bottom of the Configure page, we click \u003cstrong\u003eCreate new action\u003c/strong\u003e.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486673-image13.png\" alt=\"Create new action\"\u003e\u003c/p\u003e\n\n\u003cp\u003eTo configure our GPT’s action, we need to specify the API authentication scheme and provide the OpenAPI schema for our API. With this information, our GPT will have what it needs to call our API properly.\u003c/p\u003e\n\n\u003cp\u003eFor authentication, we select \u003cstrong\u003eAPI Key\u003c/strong\u003e as the authentication type. Then, we enter the value we set in our variables for \u003ccode\u003eBEARER_AUTH_API_KEY\u003c/code\u003e. Our auth type is \u003cstrong\u003eBearer\u003c/strong\u003e.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486706-image18.png\" alt=\"Authentication configuration\"\u003e\u003c/p\u003e\n\n\u003cp\u003eFor schema, we need to import or paste in the OpenAPI specification for our API. This specification let\u0026#39;s ChatGPT know what endpoints are available and how to interact with our API. Fortunately, because we use \u003ccode\u003eswagger-ui-express\u003c/code\u003e, we have access to a dynamically generated OpenAPI spec simply by visiting the \u003ccode\u003e/api-docs/openapi.yaml\u003c/code\u003e route in our Heroku app.\u003c/p\u003e\n\n\u003cp\u003eWe click \u003cstrong\u003eImport from URL\u003c/strong\u003e and paste in the URL for our Heroku app serving up the OpenAPI spec (for example, \u003ccode\u003ehttps://my-gpt-12345.herokuapp.com/api-docs/openapi.yaml\u003c/code\u003e). Then, we click \u003cstrong\u003eImport\u003c/strong\u003e. This loads in the schema.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486743-image3.png\" alt=\"OpenAPI schema\"\u003e\u003c/p\u003e\n\n\u003cp\u003eWith the configurations for action set, we click \u003cstrong\u003eSave\u003c/strong\u003e (Publish to \u003cstrong\u003eOnly me\u003c/strong\u003e).\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486767-image15.png\" alt=\"Publish options\"\u003e\u003c/p\u003e\n\n\u003cp\u003eNow, we can test out some interactions with our GPT.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486800-image14.png\" alt=\"Using the GPT\"\u003e\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1711486835-image20.png\" alt=\"Using the GPT example\"\u003e\u003c/p\u003e\n\n\u003cp\u003eEverything is connected and working! If you’ve been following by performing all these steps along the way, then congratulations on building your first GPT!\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='conclusion' href='#conclusion'\u003eConclusion\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eExperience in building and deploying custom GPTs sets you up to enhance the ChatGPT experience of businesses and individuals who are adopting it en masse. The majority of the work in building a GPT with an action is in implementing the API. After this, you only need to make a few setup configurations, and you’re good to go.\u003c/p\u003e\n\n\u003cp\u003eDeploying your API to Heroku—along with any add-ons you might need, like a database or a key-value store—is quick, simple, and low cost. When you’re ready to get started, \u003ca href=\"https://signup.heroku.com/\"\u003esign up\u003c/a\u003e for a Heroku account and begin building today!\u003c/p\u003e\n","published_at":"2024-03-28T16:00:00.000Z","permalink":"https://blog.heroku.com/gpt-backed-heroku-api","tags":["openai","gpt","nodejs","AI"],"summary":"How to connect your GPT on OpenAI to a backend Node.js app"},{"title":"Working with ChatGPT Functions on Heroku","content":"\u003ch2 class='anchored'\u003e\n  \u003ca name='how-to-build-and-deploy-a-node-js-app-that-uses-openai-s-apis' href='#how-to-build-and-deploy-a-node-js-app-that-uses-openai-s-apis'\u003eHow to Build and Deploy a Node.js App That Uses OpenAI’s APIs\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eNear the end of 2023, ChatGPT \u003ca href=\"https://www.linkedin.com/news/story/chatgpt-hits-100m-weekly-users-5808204/\"\u003eannounced\u003c/a\u003e that it had 100M weekly users. That’s a massive base of users who want to take advantage of the convenience and power of intelligent question answering with natural language.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1706564004-chatgpt.png\" alt=\"ChatGPT Interface\"\u003e\u003c/p\u003e\n\n\u003cp\u003eWith this level of popularity for ChatGPT, it’s no wonder that software developers are joining the ChatGPT app gold rush, building tools on top of OpenAI’s APIs. Building and deploying a GenAI-based app is quite easy to do—and we’re going to show you how!\u003c/p\u003e\n\n\u003c!-- more --\u003e\n\n\u003cp\u003eIn this post, we walk through how to build a Node.js application that works with OpenAI’s \u003ca href=\"https://platform.openai.com/docs/guides/text-generation/chat-completions-api\"\u003eChat Completions API\u003c/a\u003e and uses its \u003ca href=\"https://platform.openai.com/docs/guides/function-calling\"\u003efunction calling\u003c/a\u003e feature. We deploy it all to Heroku for quick, secure, and simple hosting. And we’ll have some fun along the way. This project is part of our new \u003ca href=\"https://github.com/heroku-reference-apps\"\u003eHeroku Reference Applications\u003c/a\u003e, a GitHub organization where we host different projects showcasing architectures to deploy to Heroku.\u003c/p\u003e\n\n\u003cp\u003eReady? Let’s go!\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='meet-the-menu-maker' href='#meet-the-menu-maker'\u003eMeet the Menu Maker\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eOur web application is called Menu Maker. What does it do? Menu Maker lets users enter a list of ingredients that they have available to them. Menu Maker comes up with a dish using those ingredients. It provides a description of the dish as you’d find it on a fine dining menu, along with a full ingredients list and recipe instructions. \u003c/p\u003e\n\n\u003cp\u003eThis basic example of using generative AI uses the user-supplied ingredients, additional instructional prompts, and some structured constraints via ChatGPT\u0026#39;s functions calling to create new content. The application’s code provides the user experience and the data flow.\u003c/p\u003e\n\n\u003cp\u003eMenu Maker is a Node.js application with a React front-end UI that talks to an Express back-end API server. The Node.js application is a monorepo, containing both front-end and back-end code, stored at GitHub. The entire application is deployed on Heroku.\u003c/p\u003e\n\n\u003cp\u003eHere’s a preview of Menu Maker in action:\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1706564097-menumaker.gif\" alt=\"Menu Maker in action\"\u003e\u003c/p\u003e\n\n\u003cp\u003eLet’s briefly break down the application flow:\u003c/p\u003e\n\n\u003col\u003e\n\u003cli\u003eThe back-end server takes the user’s form submission, supplements it with additional information, and then sends a request to OpenAI’s Chat Completions API.\u003c/li\u003e\n\u003cli\u003eThe back-end server receives the response from OpenAI and passes it up to the front-end.\u003c/li\u003e\n\u003cli\u003eThe front-end updates the interface to reflect the response received from OpenAI.\u003c/li\u003e\n\u003c/ol\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1706563894-Heroku%20AI%20Ref%20App%20%231%20-%20Heroku%20Reference%20App%201%20-%20Architecture.jpg\" alt=\"Architecture Diagram\"\u003e\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='prerequisites' href='#prerequisites'\u003ePrerequisites\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003e\u003cstrong\u003eNote\u003c/strong\u003e: If you want to try the application first, deploy it using the “Deploy to Heroku” button in the reference application’s \u003ca href=\"https://github.com/heroku-reference-apps/menumaker/blob/main/README.md\"\u003eREADME\u003c/a\u003e file.\u003c/p\u003e\n\n\u003cp\u003eBefore we dive into the code let’s cover the prerequisites. Here’s what you need to get started:\u003c/p\u003e\n\n\u003col\u003e\n\u003cli\u003eAn \u003ca href=\"https://openai.com/\"\u003eOpenAI account\u003c/a\u003e. You must add a payment method and purchase a small amount of credit to access its APIs. As we built and tested our application, the total cost of all the API calls made was less than $1*. \u003c/li\u003e\n\u003cli\u003eAfter setting up your OpenAI account, create a \u003ca href=\"https://platform.openai.com/api-keys\"\u003esecret API key\u003c/a\u003e and copy it down. Your application back-end needs this key to authenticate its requests to the OpenAI API.\u003c/li\u003e\n\u003cli\u003eA \u003ca href=\"https://signup.heroku.com/\"\u003eHeroku account\u003c/a\u003e. You must add a payment method to cover your compute costs. For building and testing this application, we recommend using an \u003ca href=\"https://devcenter.heroku.com/articles/eco-dyno-hours\"\u003eEco dyno\u003c/a\u003e, which has a $5 monthly flat fee and provides more than enough hours for your initial app.\u003c/li\u003e\n\u003cli\u003eA \u003ca href=\"https://github.com/\"\u003eGitHub account\u003c/a\u003e for your code repository. Heroku hooks into your GitHub repo directly, simplifying deployment to a single click.\u003c/li\u003e\n\u003c/ol\u003e\n\n\u003cp\u003e\u003cstrong\u003eNote\u003c/strong\u003e: Every menu recipe request incurs costs and the price varies depending on the selected model. For example, using the GPT-3 model, in order to spend $1, you\u0026#39;d have to request more than 30,000 recipes. See the \u003ca href=\"https://openai.com/pricing\"\u003eOpenAI API pricing\u003c/a\u003e page for more information.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='initial-steps' href='#initial-steps'\u003eInitial Steps\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eFor our environment, we use Node \u003ccode\u003ev20.10.0\u003c/code\u003e and \u003ccode\u003eyarn\u003c/code\u003e as our package manager. Start by cloning the \u003ca href=\"https://github.com/heroku-reference-apps/menumaker\"\u003ecodebase available in our Heroku Reference Applications GitHub organization\u003c/a\u003e. Then, install your dependencies by running:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode\u003eyarn install\n\u003c/code\u003e\u003c/pre\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='build-the-back-end' href='#build-the-back-end'\u003eBuild the Back-End\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eOur back-end API server uses \u003ca href=\"https://expressjs.com/\"\u003eExpress\u003c/a\u003e and listens for POST requests to the \u003ccode\u003e/ingredients\u003c/code\u003e endpoint. We supplement those ingredients with more precise prompt instructions, sending a subsequent request to OpenAI.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='working-with-openai' href='#working-with-openai'\u003eWorking with OpenAI\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eAlthough OpenAI’s API supports advanced usage like image generation or speech-to-text, the simplest use case is to work with \u003ca href=\"https://platform.openai.com/docs/guides/text-generation\"\u003etext generation\u003c/a\u003e. You send a set of messages to let OpenAI know what you’re seeking, and what kind of behavior you expect as it responds to you.\u003c/p\u003e\n\n\u003cp\u003eTypically, the first message is a \u003ccode\u003esystem\u003c/code\u003e message, where you specify the desired behavior of ChatGPT. Eventually, you end up with a string of messages, a conversation, between the \u003ccode\u003euser\u003c/code\u003e (you) and the \u003ccode\u003eassistant\u003c/code\u003e (ChatGPT).\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='call-functions-with-openai' href='#call-functions-with-openai'\u003eCall Functions with OpenAI\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eMost users are familiar with the chatbot-style conversation format of ChatGPT. However, developers want structured data, like a JSON object, in their ChatGPT responses. JSON makes it easier to work with responses programmatically.\u003c/p\u003e\n\n\u003cp\u003eFor example, imagine asking ChatGPT for a list of events in the 2020 Summer Olympics. As a programmer, you want to process the response by inserting each Olympic event into a database. You also want to send follow-up API requests for each event returned. In this case, you don’t want several paragraphs of ChatGPT describing Olympic events in prose. You’d rather have a JSON object with an array of event names.\u003c/p\u003e\n\n\u003cp\u003eUse cases like these are where ChatGPT \u003cem\u003e\u003ca href=\"https://platform.openai.com/docs/guides/function-calling\"\u003efunctions\u003c/a\u003e\u003c/em\u003e come in handy. Alongside the set of \u003ccode\u003emessages\u003c/code\u003e you send to OpenAI, you send \u003ccode\u003efunctions\u003c/code\u003e, which detail how you use the response from OpenAI. You can specify the name of a function to call, along with data types and descriptions of all the parameters to pass to that function.\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eNote:\u003c/strong\u003e ChatGPT \u003cem\u003edoesn’t\u003c/em\u003e call functions as part of its response. Instead, it provides a formatted response that you can easily feed directly into a custom function in your code.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='initialize-prompt-settings-with-function-information' href='#initialize-prompt-settings-with-function-information'\u003eInitialize Prompt Settings with Function Information\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eLet’s take a look at \u003ccode\u003esrc/server/ai.js\u003c/code\u003e. In our code, we send a \u003ccode\u003esettings\u003c/code\u003e object to the Chat Completions API. The \u003ccode\u003esettings\u003c/code\u003e object starts with the following:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"language-javascript\"\u003econst settings = {\n  functions: [\n    {\n    name: \u0026#39;updateDish\u0026#39;,\n    description: \u0026#39;Generate a fine dining dish based on a list of ingredients\u0026#39;,\n    parameters: {\n        type: \u0026#39;object\u0026#39;,\n        properties: {\n        title: {\n            type: \u0026#39;string\u0026#39;,\n            description: \u0026#39;Name of the dish, as it would appear on a fine dining menu\u0026#39;\n        },\n        description: {\n            type: \u0026#39;string\u0026#39;,\n            description: \u0026#39;Description of the dish, in 2-3 sentences, as it would appear on a fine dining menu\u0026#39;\n        },\n        ingredients: {\n            type: \u0026#39;array\u0026#39;,\n            description: \u0026#39;List of all ingredients--both provided and additional ones in the dish you have conceived--capitalized, along with measurements, that would be needed to make 8 servings of this dish\u0026#39;,\n            items: {\n            type: \u0026#39;object\u0026#39;,\n            properties: {\n                ingredient: {\n                type: \u0026#39;string\u0026#39;,\n                description: \u0026#39;Name of ingredient\u0026#39;\n                },\n                amount: {\n                type: \u0026#39;string\u0026#39;,\n                description: \u0026#39;Amount of ingredient needed for recipe\u0026#39;\n                }\n            }\n            }\n        },\n        recipe: {\n            type: \u0026#39;array\u0026#39;,\n            description: \u0026#39;Ordered list of recipe steps, numbered as \u0026quot;1.\u0026quot;, \u0026quot;2.\u0026quot;, etc., needed to make this dish\u0026#39;,\n            items: {\n            type: \u0026#39;string\u0026#39;,\n            description: \u0026#39;Recipe step\u0026#39;\n            }\n        }\n        },\n        required: [\u0026#39;title\u0026#39;, \u0026#39;description\u0026#39;, \u0026#39;ingredients\u0026#39;, \u0026#39;recipe\u0026#39;]\n    }\n    }\n  ],\n  model: CHATGPT_MODEL,\n  function_call: \u0026#39;auto\u0026#39;\n}\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eWe’re telling OpenAI that we plan to use its response in a function that we call \u003ccode\u003eupdateDish\u003c/code\u003e, a function in our React front-end code. When calling \u003ccode\u003eupdateDish\u003c/code\u003e, we must pass in an object with four parameters:\u003c/p\u003e\n\n\u003col\u003e\n\u003cli\u003e\u003ccode\u003etitle\u003c/code\u003e: the name of our dish\u003c/li\u003e\n\u003cli\u003e\u003ccode\u003edescription\u003c/code\u003e: a description of our dish\u003c/li\u003e\n\u003cli\u003e\u003ccode\u003eingredients\u003c/code\u003e: an array of objects, each having an \u003ccode\u003eingredient\u003c/code\u003e name and \u003ccode\u003eamount\u003c/code\u003e\u003c/li\u003e\n\u003cli\u003e\u003ccode\u003erecipe\u003c/code\u003e: an array of recipe steps for making the dish\u003c/li\u003e\n\u003c/ol\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='send-settings-with-ingredients-attached' href='#send-settings-with-ingredients-attached'\u003eSend Settings with Ingredients Attached\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eIn addition to the \u003ccode\u003efunctions\u003c/code\u003e specification, we must attach \u003ccode\u003emessages\u003c/code\u003e in our request \u003ccode\u003esettings\u003c/code\u003e, to clearly tell ChatGPT what we want it to do. Our module’s \u003ccode\u003esend\u003c/code\u003e function looks like:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"language-javascript\"\u003econst PROMPT = \u0026#39;I am writing descriptions of dishes for a menu. I am going to provide you with a list of ingredients. Based on that list, please come up with a dish that can be created with those ingredients.\u0026#39;\n\nconst send = async (ingredients) =\u0026gt; {\n  const openai = new OpenAI({\n    timeout: 10000,\n    maxRetries: 3\n  })\n  settings.messages = [\n    {\n      role: \u0026#39;system\u0026#39;,\n      content: PROMPT\n    }, {\n      role: \u0026#39;user\u0026#39;,\n      content: `The ingredients that will contribute to my dish are: ${ingredients}.`\n    }\n  ]\n  const completion = await openai.chat.completions.create(settings)\n  return completion.choices[0].message\n}\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eOur Node.js application imports the \u003ccode\u003e\u003ca href=\"https://www.npmjs.com/package/openai\"\u003eopenai\u003c/a\u003e\u003c/code\u003e package (not shown), which serves as a handy JavaScript library for OpenAI. It abstracts away the details of sending HTTP requests to the OpenAI API.\u003c/p\u003e\n\n\u003cp\u003eWe start with a \u003ccode\u003esystem\u003c/code\u003e message that tells ChatGPT what the basic task is and the behavior we expect. Then, we add a \u003ccode\u003euser\u003c/code\u003e message that includes the ingredients, which gets passed as an argument to the \u003ccode\u003esend\u003c/code\u003e function. We send these \u003ccode\u003esettings\u003c/code\u003e to the API, asking it to \u003ccode\u003e\u003ca href=\"https://platform.openai.com/docs/api-reference/chat/create\"\u003ecreate\u003c/a\u003e\u003c/code\u003e a model response. Then, we return the response \u003ccode\u003emessage\u003c/code\u003e.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='handle-the-post-request' href='#handle-the-post-request'\u003eHandle the POST Request\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eIn \u003ccode\u003esrc/server/index.js\u003c/code\u003e, we set up our Express server and handle POST requests to \u003ccode\u003e/ingredients\u003c/code\u003e. Our code looks like:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"language-javascript\"\u003eimport express from \u0026#39;express\u0026#39;\nimport AI from \u0026#39;./ai.js\u0026#39;\n\nconst server = express()\nserver.use(express.json())\n\nserver.post(\u0026#39;/ingredients\u0026#39;, async (req, res) =\u0026gt; {\n  if (process.env.NODE_ENV !== \u0026#39;test\u0026#39;) {\n    console.log(`Request to /ingredients received: ${req.body.message}`)\n  }\n  if ((typeof req.body.message) === \u0026#39;undefined\u0026#39; || !req.body.message.length) {\n    res.status(400).json({ error: \u0026#39;No ingredients provided in \u0026quot;message\u0026quot; key of payload.\u0026#39; })\n    return\n  }\n  try {\n    const completionResponse = await AI.send(req.body.message)\n    res.json(completionResponse.function_call)\n  } catch (error) {\n    res.status(500).json({ error: error.message })\n  }\n})\n\nexport default server\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eAfter removing the error handling and log messages, the most important lines of code are:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"language-javascript\"\u003econst completionResponse = await AI.send(req.body.message)\nres.json(completionResponse.function_call)\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eOur server passes the request payload \u003ccode\u003emessage\u003c/code\u003e contents to our module’s \u003ccode\u003esend\u003c/code\u003e method. The response, from OpenAI, and then from our module, is an object that includes a \u003ccode\u003efunction_call\u003c/code\u003e subobject. \u003ccode\u003efunction_call\u003c/code\u003e has a \u003ccode\u003ename\u003c/code\u003e and \u003ccode\u003earguments\u003c/code\u003e, which we use in our custom \u003ccode\u003eupdateDish\u003c/code\u003e function.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='testing-the-back-end' href='#testing-the-back-end'\u003eTesting the Back-End\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eWe’re ready to test our back-end!\u003c/p\u003e\n\n\u003cp\u003eThe \u003ccode\u003eopenai\u003c/code\u003e JavaScript package expects an environment variable called \u003ccode\u003eOPENAI_API_KEY\u003c/code\u003e. We set up our server \u003ca href=\"https://devcenter.heroku.com/articles/heroku-local#run-your-app-locally-with-the-heroku-local-command-line-tool-start-your-app-locally\"\u003eto listen on port 3000\u003c/a\u003e, and then we start it:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode\u003eOPENAI_API_KEY=sk-Kie*** node index.js\nServer is running on port 3000\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eIn a separate terminal, we send a request with curl:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode\u003ecurl -X POST \\\n  --header \u0026quot;Content-type:application/json\u0026quot; \\\n  --data \u0026quot;{\\\u0026quot;message\\\u0026quot;:\\\u0026quot;cauliflower, fresh rosemary, parmesan cheese\\\u0026quot;}\u0026quot; \\\n  http://localhost:3000/ingredients\n\n{\u0026quot;name\u0026quot;:\u0026quot;updateDish\u0026quot;,\u0026quot;arguments\u0026quot;:\u0026quot;{\\\u0026quot;title\\\u0026quot;:\\\u0026quot;Crispy Rosemary Parmesan Cauliflower\\\u0026quot;,\\\u0026quot;description\\\u0026quot;:\\\u0026quot;Tender cauliflower florets roasted to perfection with aromatic fresh rosemary and savory Parmesan cheese, creating a crispy and flavorful dish.\\\u0026quot;,\\\u0026quot;ingredients\\\u0026quot;:[{\\\u0026quot;ingredient\\\u0026quot;:\\\u0026quot;cauliflower\\\u0026quot;,\\\u0026quot;amount\\\u0026quot;:\\\u0026quot;1 large head, cut into florets\\\u0026quot;},{\\\u0026quot;ingredient\\\u0026quot;:\\\u0026quot;fresh rosemary\\\u0026quot;,\\\u0026quot;amount\\\u0026quot;:\\\u0026quot;2 tbsp, chopped\\\u0026quot;},{\\\u0026quot;ingredient\\\u0026quot;:\\\u0026quot;parmesan cheese\\\u0026quot;,\\\u0026quot;amount\\\u0026quot;:\\\u0026quot;1/2 cup, grated\\\u0026quot;},{\\\u0026quot;ingredient\\\u0026quot;:\\\u0026quot;olive oil\\\u0026quot;,\\\u0026quot;amount\\\u0026quot;:\\\u0026quot;3 tbsp\\\u0026quot;},{\\\u0026quot;ingredient\\\u0026quot;:\\\u0026quot;salt\\\u0026quot;,\\\u0026quot;amount\\\u0026quot;:\\\u0026quot;to taste\\\u0026quot;},{\\\u0026quot;ingredient\\\u0026quot;:\\\u0026quot;black pepper\\\u0026quot;,\\\u0026quot;amount\\\u0026quot;:\\\u0026quot;to taste\\\u0026quot;}],\\\u0026quot;recipe\\\u0026quot;:[\\\u0026quot;1. Preheat the oven to 425°F.\\\u0026quot;,\\\u0026quot;2. In a large bowl, toss the cauliflower florets with olive oil, chopped rosemary, salt, and black pepper.\\\u0026quot;,\\\u0026quot;3. Spread the cauliflower on a baking sheet and roast for 25-30 minutes, or until golden brown and crispy.\\\u0026quot;,\\\u0026quot;4. Sprinkle the roasted cauliflower with grated Parmesan cheese and return to the oven for 5 more minutes, until the cheese is melted and bubbly.\\\u0026quot;,\\\u0026quot;5. Serve hot and enjoy!\\\u0026quot;]}\u0026quot;}\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eIt works! We have a JSON response with \u003ccode\u003earguments\u003c/code\u003e that our back-end can pass to the front-end’s \u003ccode\u003eupdateDish\u003c/code\u003e function.\u003c/p\u003e\n\n\u003cp\u003eLet’s briefly touch on what we did for the front-end UI.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='build-the-front-end' href='#build-the-front-end'\u003eBuild the Front-End\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eAll the OpenAI-related work happened in the back-end, so we won’t spend too much time unpacking the front-end. We built a basic React application that uses \u003ca href=\"https://mui.com/material-ui/getting-started/\"\u003eMaterial UI\u003c/a\u003e for styling. You can poke around in \u003ccode\u003esrc/client\u003c/code\u003e to see all the details for our front-end application.\u003c/p\u003e\n\n\u003cp\u003eIn \u003ccode\u003esrc/client/App.js\u003c/code\u003e, we see how our app handles the user’s web form submission:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"language-javascript\"\u003econst handleSubmit = async (inputValue) =\u0026gt; {\n  if (inputValue.length === 0) {\n    setErrorMessage(\u0026#39;Please provide ingredients before submitting the form.\u0026#39;)\n    return\n  }\n  try {\n    setWaiting(true)\n    const response = await fetch(\u0026#39;/ingredients\u0026#39;, {\n      method: \u0026#39;POST\u0026#39;,\n      headers: {\n        \u0026#39;Content-Type\u0026#39;: \u0026#39;application/json\u0026#39;\n      },\n      body: JSON.stringify({ message: inputValue })\n    })\n    const data = await response.json()\n    if (!response.ok) {\n      setErrorMessage(data.error)\n      return\n    }\n\n    updateDish(JSON.parse(data.arguments))\n  } catch (error) {\n    setErrorMessage(error)\n  }\n}\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eWhen a user submits the form, the application sends a POST request to \u003ccode\u003e/ingredients\u003c/code\u003e. The \u003ccode\u003earguments\u003c/code\u003e object in the response is JSON-parsed, then sent directly to our \u003ccode\u003eupdateDish\u003c/code\u003e function. Using ChatGPT’s function calling feature significantly simplifies the steps to handle the response programmatically.\u003c/p\u003e\n\n\u003cp\u003eOur \u003ccode\u003eupdateDish\u003c/code\u003e function looks like:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode class=\"language-javascript\"\u003econst [title, setTitle] = useState(\u0026#39;\u0026#39;)\nconst [waiting, setWaiting] = useState(false)\nconst [description, setDescription] = useState(\u0026#39;\u0026#39;)\nconst [recipeSteps, setRecipeSteps] = useState([])\nconst [ingredients, setIngredients] = useState([])\nconst [errorMessage, setErrorMessage] = useState(\u0026#39;\u0026#39;)\nconst updateDish = ({ title, description, recipe, ingredients }) =\u0026gt; {\n  setTitle(title)\n  setDescription(description)\n  setRecipeSteps(recipe)\n  setIngredients(ingredients)\n  setWaiting(false)\n  setErrorMessage(\u0026#39;\u0026#39;)\n}\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eYes, that’s it. We work with \u003ca href=\"https://react.dev/learn/state-a-components-memory\"\u003eReact states\u003c/a\u003e to keep track of our dish title, description, ingredients, and recipe. When \u003ccode\u003eupdateDish\u003c/code\u003e updates these values, all of our components update accordingly. \u003c/p\u003e\n\n\u003cp\u003eOur back-end and front-end pieces are all done. All that’s left to do is deploy.\u003c/p\u003e\n\n\u003cp\u003eNot shown in this walkthrough, but which you can find in the code repository, are:\u003c/p\u003e\n\n\u003cul\u003e\n\u003cli\u003eBasic unit tests for back-end and front-end components, using \u003ca href=\"https://jestjs.io/\"\u003eJest\u003c/a\u003e\u003c/li\u003e\n\u003cli\u003e\u003ca href=\"https://eslint.org/\"\u003eESLint\u003c/a\u003e and \u003ca href=\"https://prettier.io/\"\u003ePrettier\u003c/a\u003e configurations to keep our code clean and readable\u003c/li\u003e\n\u003cli\u003e\u003ca href=\"https://babeljs.io/\"\u003eBabel\u003c/a\u003e and \u003ca href=\"https://webpack.js.org/\"\u003eWebpack\u003c/a\u003e configurations for working with modules and packaging our front-end code for deployment\u003c/li\u003e\n\u003c/ul\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='deploy-to-heroku' href='#deploy-to-heroku'\u003eDeploy to Heroku\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eWith our codebase committed to GitHub, we’re ready to deploy our entire application on Heroku. You can also use the \u003ca href=\"https://www.heroku.com/elements/buttons\"\u003eHeroku Button\u003c/a\u003e in the reference repository to simplify the deployment.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='step-1-create-a-new-heroku-app' href='#step-1-create-a-new-heroku-app'\u003eStep 1: Create a New Heroku App\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eAfter logging in to Heroku, click “Create new app” in the Heroku Dashboard.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1706564301-create-new-app.png\" alt=\"Create a new Heroku app\"\u003e\u003c/p\u003e\n\n\u003cp\u003eNext, provide a name for your app and click “Create app”.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1706564335-app-name.png\" alt=\"Application name\"\u003e\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='step-2-connect-your-repository' href='#step-2-connect-your-repository'\u003eStep 2: Connect Your Repository\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eWith your Heroku app created, \u003ca href=\"https://devcenter.heroku.com/articles/github-integration#enabling-github-integration\"\u003econnect it to the GitHub repository\u003c/a\u003e for your project.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1706564362-connect-github.png\" alt=\"Connect to GitHub\"\u003e\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='step-3-set-up-config-vars' href='#step-3-set-up-config-vars'\u003eStep 3: Set Up Config Vars\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eRemember that your application back-end needs an OpenAI API key to authenticate requests. Navigate to your app “Settings”, then look for “Config Vars”. Add a new config var called \u003ccode\u003eOPENAI_API_KEY\u003c/code\u003e, and paste in the value for your key.\u003c/p\u003e\n\n\u003cp\u003eOptionally, you can also set a \u003ccode\u003eCHATGPT_MODEL\u003c/code\u003e config var, telling \u003ccode\u003esrc/server/ai.js\u003c/code\u003e which \u003ca href=\"https://platform.openai.com/docs/models/overview\"\u003eGPT model\u003c/a\u003e you want OpenAI to use. Models differ in capabilities, training data cutoff date, speed, and usage cost. If you don’t specify this config var, Menu Maker defaults to \u003ccode\u003egpt-3.5-turbo-1106\u003c/code\u003e.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1706564394-config-vars.png\" alt=\"Setup config vars\"\u003e\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='step-4-deploy' href='#step-4-deploy'\u003eStep 4: Deploy\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eGo to the “Deploy” tab for your Heroku app. Click “Deploy Branch”. Heroku takes the latest commit on the main branch, builds the application (\u003ccode\u003eyarn build\u003c/code\u003e), and then starts it up (\u003ccode\u003eyarn start\u003c/code\u003e). With just one click, you can deploy and update your application in under a minute.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1706564420-deploy.png\" alt=\"Deploy the app\"\u003e\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='step-5-open-your-app' href='#step-5-open-your-app'\u003eStep 5: Open Your App\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eWith the app deployed, click “Open app” at the top of your Heroku app page to get redirected to the unique and secure URL for your app.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1706564446-open-app.png\" alt=\"Open application\"\u003e\u003c/p\u003e\n\n\u003cp\u003eWith that, your shiny, new, ChatGPT-powered web application is up and running!\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='step-6-scale-down-your-app' href='#step-6-scale-down-your-app'\u003eStep 6: Scale Down Your App\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eWhen you’re done using the app, remember to \u003ca href=\"https://devcenter.heroku.com/articles/scaling#manual-scaling\"\u003escale your dynos to zero\u003c/a\u003e to prevent incurring unwanted costs.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='conclusion' href='#conclusion'\u003eConclusion\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eWith all the recent hype surrounding generative AI, many developers are itching to build ChatGPT-powered applications. Working with OpenAI’s API can initially seem daunting, but it’s straightforward. In addition, OpenAI’s function calling feature simplifies your task by accommodating your structured data needs.\u003c/p\u003e\n\n\u003cp\u003eWhen it comes to quick and easy deployment, you can get up and running on Heroku within minutes, for just a few dollars a month. While the demonstration here works specifically with ChatGPT, it’s just as easy to deploy apps that use other foundation models, such as Google Bard, LLaMA from Meta, or other APIs.\u003c/p\u003e\n\n\u003cp\u003eAre you ready to take the plunge into building GenAI-based applications? Today is the day. Happy coding!\u003c/p\u003e\n","published_at":"2024-01-30T09:00:00.000Z","permalink":"https://blog.heroku.com/working-with-chatgpt-functions-on-heroku","tags":["react","node.js","chatgpt","AI"],"summary":"Learn to build and deploy a Node.js app that uses OpenAI’s APIs with function invocation"},{"title":"How to Use pgvector for Similarity Search on Heroku Postgres","content":"\u003ch2 class='anchored'\u003e\n  \u003ca name='introducing-pgvector-for-heroku-postgres' href='#introducing-pgvector-for-heroku-postgres'\u003eIntroducing pgvector for Heroku Postgres\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eOver the past few weeks, we worked on adding \u003ca href=\"https://github.com/pgvector/pgvector\"\u003epgvector\u003c/a\u003e as an extension on Heroku Postgres. We\u0026#39;re excited to release this feature, and based on the feedback on \u003ca href=\"https://github.com/heroku/roadmap/issues/156\"\u003eour public roadmap\u003c/a\u003e, many of you are too. We want to share a bit more about how you can use it and how it may be helpful to you. \u003c/p\u003e\n\n\u003cp\u003eAll \u003ca href=\"https://devcenter.heroku.com/articles/heroku-postgres-plans#plan-tiers\"\u003eStandard-tier or higher\u003c/a\u003e databases running Postgres 15 now support the \u003ca href=\"https://devcenter.heroku.com/articles/heroku-postgres-extensions-postgis-full-text-search#pgvector\"\u003e\u003ccode\u003epgvector\u003c/code\u003e extension\u003c/a\u003e. You can get started by running \u003ccode\u003eCREATE EXTENSION vector;\u003c/code\u003e in a client session. Postgres 15 has been the default version on Heroku Postgres since March 2023.  If you\u0026#39;re on an older version and want to use pgvector, \u003ca href=\"https://devcenter.heroku.com/articles/upgrading-heroku-postgres-databases\"\u003eupgrade\u003c/a\u003e to Postgres 15.\u003c/p\u003e\n\n\u003cp\u003eThe extension adds the vector data type to Heroku Postgres along with additional functions to work with it. Vectors are important for working with large language models and other machine learning applications, as the \u003ca href=\"https://huggingface.co/blog/getting-started-with-embeddings#understanding-embeddings\"\u003eembeddings\u003c/a\u003e generated by these models are often output in vector format. Working with vectors lets you implement things like similarity search across these embeddings. See our \u003ca href=\"https://blog.heroku.com/pgvector-launch#understanding-pgvector-and-its-significance\"\u003elaunch blog\u003c/a\u003e for more background into what pgvector is, its significance, and ideas for how to use this new data type.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='an-example-word-vector-similarity-search' href='#an-example-word-vector-similarity-search'\u003eAn Example: Word Vector Similarity Search\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eTo show a simple example of how to generate and save vector data to your Heroku database, I\u0026#39;m using the \u003ca href=\"https://wikipedia2vec.github.io/wikipedia2vec/\"\u003eWikipedia2Vec\u003c/a\u003e pretrained embeddings. However, you can train your own embeddings or use other models providing embeddings via API, like \u003ca href=\"https://huggingface.co/blog/getting-started-with-embeddings\"\u003eHuggingFace\u003c/a\u003e or \u003ca href=\"https://openai.com/\"\u003eOpenAI\u003c/a\u003e. The model you want to use depends on the type of data you\u0026#39;re working with. There are models for tasks like computing sentence similarities, searching large texts, or performing image classification. Wikipedia2Vec uses a \u003ca href=\"https://en.wikipedia.org/wiki/Word2vec\"\u003eWord2vec\u003c/a\u003e algorithm to generate vectors for individual words, which maps similar words close to each other in a continuous vector space. \u003c/p\u003e\n\n\u003cp\u003eI like animals, so I want to use Wikipedia2Vec to group similar animals. I’m using the vector embeddings of each animal and the distance between them to find animals that are alike.\u003c/p\u003e\n\n\u003cp\u003eIf I want to get the embedding for a word from Wikipedia2Vec, I need to use a model. I downloaded one from the \u003ca href=\"https://wikipedia2vec.github.io/wikipedia2vec/pretrained/\"\u003epretrained embeddings\u003c/a\u003e on their website. Then I can use their Python module and the function \u003ccode\u003eget_word_vector\u003c/code\u003e as follows:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode\u003efrom wikipedia2vec import Wikipedia2Vec\nwiki2vec = Wikipedia2Vec.load(\u0026#39;enwiki_20180420_100d.pkl\u0026#39;)\nwiki2vec.get_word_vector(\u0026#39;llama\u0026#39;)\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eThe output of the vector looks like this:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode\u003ememmap([-0.15647224,  0.04055957,  0.48439676, -0.22689971, -0.04544162,\n        -0.06538601,  0.22609918, -0.26075622, -0.7195759 , -0.24022003,\n         0.1050799 , -0.5550985 ,  0.4054564 ,  0.14180332,  0.19856507,\n         0.09962048,  0.38372937, -1.1912689 , -0.93939453, -0.28067762,\n         0.04410955,  0.43394643, -0.3429818 ,  0.22209083, -0.46317756,\n        -0.18109794,  0.2775289 , -0.21939017, -0.27015808,  0.72002393,\n        -0.01586861, -0.23480305,  0.365697  ,  0.61743397, -0.07460125,\n        -0.10441436, -0.6537417 ,  0.01339269,  0.06189647, -0.17747395,\n         0.2669941 , -0.03428648, -0.8533792 , -0.09588563, -0.7616592 ,\n        -0.11528812, -0.07127796,  0.28456485, -0.12986512, -0.8063386 ,\n        -0.04875885, -0.27353695, -0.32921   , -0.03807172,  0.10544889,\n         0.49989182, -0.03783042, -0.37752548, -0.19257008,  0.06255971,\n         0.25994852, -0.81092316, -0.15077794,  0.00658835,  0.02033841,\n        -0.32411653, -0.03033727, -0.64633304, -0.43443972, -0.30764043,\n        -0.11036412,  0.04134453, -0.26934972, -0.0289086 , -0.50319433,\n        -0.0204528 , -0.00278326,  0.36589545,  0.5446438 , -0.10852882,\n         0.09699931, -0.01168614,  0.08618425, -0.28925297, -0.25445923,\n         0.63120073,  0.52186656,  0.3439454 ,  0.6686451 ,  0.1076297 ,\n        -0.34688494,  0.05976971, -0.3720558 ,  0.20328045, -0.485623  ,\n        -0.2222396 , -0.22480975,  0.4386788 , -0.7506131 ,  0.14270408],\n       dtype=float32)\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eTo get your vector data into your database:\u003c/p\u003e\n\n\u003col\u003e\n\u003cli\u003eGenerate the embeddings.\u003c/li\u003e\n\u003cli\u003eAdd a column to your database to store your embeddings.\u003c/li\u003e\n\u003cli\u003eSave the embeddings to the database.\u003c/li\u003e\n\u003c/ol\u003e\n\n\u003cp\u003eI already have the embeddings from Wikipedia2Vec, so let’s walk through preparing my database and saving them. When creating a vector column, it\u0026#39;s necessary to declare a length for it, so check and see the length of the embedding the model outputs. In my case, the embeddings are 100 numbers long, so I add that column to my table.\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode\u003eCREATE TABLE animals(id serial PRIMARY KEY, name VARCHAR(100), embedding VECTOR(100));\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eFrom there, save the items you\u0026#39;re interested in to your database. You can do it directly in SQL:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode\u003eINSERT INTO animals(name, embedding) VALUES (\u0026#39;llama\u0026#39;, \u0026#39;[-0.15647223591804504, \n…\n-0.7506130933761597, 0.1427040845155716]\u0026#39;);\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eBut you can also use your \u003ca href=\"https://devcenter.heroku.com/articles/connecting-heroku-postgres\"\u003efavorite programming language\u003c/a\u003e along with a Postgres client and a \u003ca href=\"https://github.com/pgvector/pgvector#languages\"\u003epgvector library\u003c/a\u003e. For this example, I used Python, \u003ca href=\"https://github.com/psycopg/psycopg\"\u003epsycopg\u003c/a\u003e, and \u003ca href=\"https://github.com/pgvector/pgvector-python\"\u003epgvector-python\u003c/a\u003e. Here I\u0026#39;m using the pretrained embedding file to generate embeddings for a list of animals I made, \u003ccode\u003evaleries-animals.txt\u003c/code\u003e,  and save them to my database.\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode\u003eimport psycopg\nfrom pathlib import Path\nfrom pgvector.psycopg import register_vector\nfrom wikipedia2vec import Wikipedia2Vec\n\nwiki2vec = Wikipedia2Vec.load(\u0026#39;enwiki_20180420_100d.pkl\u0026#39;)\nanimals = Path(\u0026#39;valeries-animals.txt\u0026#39;).read_text().split(\u0026#39;\\n\u0026#39;)\n\nwith psycopg.connect(DATABASE_URL, sslmode=\u0026#39;require\u0026#39;, autocommit=True) as conn:\n    register_vector(conn)\n    cur = conn.cursor()\n    for animal in animals:\n        cur.execute(\u0026quot;INSERT INTO animals(name, embedding) VALUES (%s, %s)\u0026quot;, (animal, wiki2vec.get_word_vector(animal)))\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eNow that I have the embeddings in my database, I can use pgvector\u0026#39;s functions to query them. The extension includes functions to calculate Euclidean distance (\u003ccode\u003e\u0026lt;-\u0026gt;\u003c/code\u003e), cosine distance (\u003ccode\u003e\u0026lt;=\u0026gt;\u003c/code\u003e), and inner product (\u003ccode\u003e\u0026lt;#\u0026gt;\u003c/code\u003e). You can use all three for \u003ca href=\"https://developers.google.com/machine-learning/clustering/similarity/measuring-similarity\"\u003ecalculating similarity\u003c/a\u003e between vectors. Which one you use depends on \u003ca href=\"https://cmry.github.io/notes/euclidean-v-cosine\"\u003eyour data as well as your use case\u003c/a\u003e.\u003c/p\u003e\n\n\u003cp\u003eHere I\u0026#39;m using Euclidean distance to find the five animals closest to a shark:\u003c/p\u003e\n\n\u003cpre\u003e\u003ccode\u003e=\u0026gt; SELECT name FROM animals WHERE name != \u0026#39;shark\u0026#39; ORDER BY embedding \u0026lt;-\u0026gt; (SELECT embedding FROM animals WHERE name = \u0026#39;shark\u0026#39;) LIMIT 5;\n name \n-----------\n crocodile\n dolphin\n whale\n turtle\n alligator\n(5 rows)\n\u003c/code\u003e\u003c/pre\u003e\n\n\u003cp\u003eIt works! It\u0026#39;s worth noting that the model that we used is based on words appearing together in Wikipedia articles, and different models or source corpuses likely yield different results. The results here are also limited to the hundred or so animals that I added to my database.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='pgvector-optimization-and-performance-considerations' href='#pgvector-optimization-and-performance-considerations'\u003epgvector Optimization and Performance Considerations\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eAs you add more vector data to your database, you may notice performance issues or slowness in performing queries. You can index vector data like other columns in Postgres, and pgvector provides a few ways to do so, but there are some important considerations to keep in mind:\u003c/p\u003e\n\n\u003cul\u003e\n\u003cli\u003eAdding an index causes pgvector to switch to using approximate nearest neighbor search instead of exact nearest neighbor search, possibly causing a difference in query results.\u003c/li\u003e\n\u003cli\u003eIndexing functions are based on distance calculations, so create one based on the calculation you plan to rely on the most in your application.\u003c/li\u003e\n\u003cli\u003eThere are two index types supported, IVFFlat and HNSW. Before you add an IVFFlat index, make sure you have some data in your table for better recall.\u003c/li\u003e\n\u003c/ul\u003e\n\n\u003cp\u003eCheck out the \u003ca href=\"https://github.com/pgvector/pgvector#indexing\"\u003epgvector documentation\u003c/a\u003e for more information on indexing and other performance considerations.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='collaborate-and-share-your-pgvector-projects' href='#collaborate-and-share-your-pgvector-projects'\u003eCollaborate and Share Your pgvector Projects\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eNow that pgvector for Heroku Postgres is out in the world, we\u0026#39;re really excited to hear what you do with it! One of pgvector\u0026#39;s great advantages is that it lets vector data live alongside all the other data you might already have in Postgres. You can add an embedding column to your existing tables and start experimenting. Our \u003ca href=\"https://blog.heroku.com/pgvector-launch\"\u003elaunch blog\u003c/a\u003e for this feature includes a lot of ideas and possible use cases for how to use this new tool, and I\u0026#39;m sure you can come up with many more. If you have questions, our \u003ca href=\"https://help.heroku.com/\"\u003eSupport team\u003c/a\u003e is available to assist. Don\u0026#39;t forget you can share your solutions using the \u003ca href=\"https://devcenter.heroku.com/articles/heroku-button\"\u003eHeroku Button\u003c/a\u003e on your repo. If you feel like blogging on your success, tag us on social media and we would love to read about it!\u003c/p\u003e\n","published_at":"2023-11-15T18:42:51.442Z","permalink":"https://blog.heroku.com/pgvector-for-similarity-search-on-heroku-postgres","tags":[],"summary":"Introducing pgvector for Heroku Postgres. The extension adds the vector data type to Heroku Postgres along with additional functions to work with it."},{"title":"Router 2.0: The Road to Beta","content":"\u003cp\u003eLast month, Heroku announced the \u003ca href=\"https://devcenter.heroku.com/changelog-items/2682\"\u003ebeta release of Router 2.0\u003c/a\u003e, the new Common Runtime router! \u003c/p\u003e\n\n\u003cp\u003eAs part of our commitment to infrastructure modernization, Heroku is making upgrades to the \u003ca href=\"https://devcenter.heroku.com/articles/dyno-runtime#common-runtime\"\u003eCommon Runtime\u003c/a\u003e routing layer. The \u003ca href=\"https://devcenter.heroku.com/changelog-items/2682\"\u003ebeta release of Router 2.0\u003c/a\u003e is an important step along this journey. We’re excited to give you an inside look at all we’ve been doing to get here. \u003c/p\u003e\n\n\u003cp\u003eIn both the Common Runtime and \u003ca href=\"https://devcenter.heroku.com/articles/dyno-runtime#private-spaces-runtime\"\u003ePrivate Spaces\u003c/a\u003e, the \u003ca href=\"https://devcenter.heroku.com/articles/how-heroku-works#http-routing\"\u003eHeroku router\u003c/a\u003e is responsible for serving requests to customers’ web dynos. In 2024, Router 2.0 will replace the existing Common Runtime router. We’re being transparent about this project so that you, our customers, are motivated to try out Router 2.0 now, while it’s in beta. As an early adopter, you can help us validate that things are working as they should, particularly for \u003cem\u003eyour\u003c/em\u003e apps and \u003cem\u003eyour\u003c/em\u003e use cases. You’ll also be first in line to try out the new features we’re planning to add, like \u003ca href=\"https://github.com/heroku/roadmap/issues/34\"\u003eHTTP/2\u003c/a\u003e.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='why-a-new-router' href='#why-a-new-router'\u003eWhy a New Router?\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eNow, you may be asking, why build a new router instead of improving the existing one? Our primary motivator has been faster and safer delivery of new routing features for our customers. For a couple of reasons, this has been difficult to achieve with the Common Runtime’s legacy routing layer.\u003c/p\u003e\n\n\u003cp\u003eThe current Common Runtime router is written in Erlang. It’s built around a \u003ca href=\"https://blog.heroku.com/vegur-free-software\"\u003ecustom HTTP server library\u003c/a\u003e that supports Heroku-specific features, such as \u003ca href=\"https://devcenter.heroku.com/articles/error-codes\"\u003eH-codes\u003c/a\u003e, \u003ca href=\"https://devcenter.heroku.com/articles/eco-dyno-hours#dyno-sleeping\"\u003edyno sleeping\u003c/a\u003e, and \u003ca href=\"https://devcenter.heroku.com/articles/http-routing#heroku-router-log-format\"\u003erouter logs\u003c/a\u003e. For over 10 years, this router, dubbed “Hermes” internally, has served all requests to Heroku’s Common Runtime. At the time of Hermes’ launch, Erlang was an appropriate choice since the language places emphasis on concurrency, scalability, and fault tolerance. In addition, Erlang offers a powerful process introspection toolchain that has served our networking engineers well when \u003ca href=\"https://blog.heroku.com/erlang-in-anger\"\u003edebugging in-memory state issues\u003c/a\u003e. Our engineers embraced the language fully, also choosing to write the previous version of our logging system, \u003ca href=\"https://blog.heroku.com/logging-on-heroku\"\u003eLogplex\u003c/a\u003e, in Erlang. \u003c/p\u003e\n\n\u003cp\u003eHowever, as the years passed, development on the Hermes codebase proved difficult. The popularity of Erlang within Heroku began to taper off. The open-source and internal libraries that Hermes depends on stopped receiving the volume of contributions they once had. Productivity declined due to these factors, making significant router upgrades risky. After a few failed upgrade attempts, our team decided to pin the software versions of relevant Erlang components. This action wasn’t without trade-offs. Being pinned to an old version of Erlang became a blocker to delivering now common-place features like \u003ca href=\"https://github.com/heroku/roadmap/issues/34\"\u003eHTTP/2\u003c/a\u003e. Thus, we decided to put Hermes into maintenance mode and focus on its replacement.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='choosing-a-language' href='#choosing-a-language'\u003eChoosing a Language\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eBefore kicking off design sessions, our team discussed what broader goals we had for the replacement. In establishing our priorities, the team came to a consensus around three main goals:\u003c/p\u003e\n\n\u003cul\u003e\n\u003cli\u003e\u003cstrong\u003eWrite the router in a language everyone on our team knows well\u003c/strong\u003e. With Erlang knowledge limited to just a couple of engineers on the team, we wanted to rewrite the router in a different language. That language had to be something our team already knew well.\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eWrite the router in a language with a strong open-source community.\u003c/strong\u003e A robust community unlocks the ability to quickly adopt new specs, write features, fix bugs, and respond to CVEs. It also expands the candidate pool when it comes time to hire new engineers.\u003c/li\u003e\n\u003cli\u003e\u003cstrong\u003eShare as much code as possible between the Common Runtime and Private Spaces routers.\u003c/strong\u003e Since the Common Runtime and Private Spaces routers share most of the same features, there’s no reason for the codebases to differ much. Additionally, it’s faster and easier to deliver a feature if we only have to write it once.\u003c/li\u003e\n\u003c/ul\u003e\n\n\u003cp\u003eWith these goals in mind, the language to choose for Router 2.0 was clear — Go.\u003c/p\u003e\n\n\u003cp\u003eNot only is the Private Spaces router already written in Go, but the language has become our standard choice for developing new components of Heroku’s runtime. This story isn’t at all unique. Many others in the DevOps and cloud hosting world today have chosen Go for its performance, built-in concurrency handling, automatic garbage collection — the list goes on. Simply put, it’s a language designed specifically for building big dynamic distributed systems. Because of these factors, the Go community outside and within Heroku has flourished, with Go expertise in abundance across our runtime engineering teams.\u003c/p\u003e\n\n\u003cp\u003eToday, by writing Router 2.0 in Go, we’re creating a piece of software to which everyone on our team can contribute. Furthermore, by doubling down on the language of the Private Spaces router, we unify the source code and routing behavior of these two products. Historically, these codebases have been entirely distinct, meaning that any implementation our engineers introduce must be written twice. To combat this, we’ve extracted the common functionality of the two routers into an internal HTTP library. With a unified codebase, the delivery of features and fixes becomes faster and simpler, reducing the cognitive burden on our engineers who operate and maintain the routers.\u003c/p\u003e\n\n\u003cp\u003eDeveloping the router is only half the story, though. The other half is about introducing this service to the world as safely and seamlessly as possible.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='architecture' href='#architecture'\u003eArchitecture\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eYou may recall that back in 2021, Heroku announced the completion of an \u003ca href=\"https://blog.heroku.com/faster-dynos-for-all\"\u003einfrastructure upgrade\u003c/a\u003e to the Common Runtime that brought customers better performing dynos and lower request latencies. This upgrade involved an extensive migration from our old, “classic” cloud environment to our more performant and secure “sharded” environment. We wanted to complete this migration without disrupting any active traffic or asking customers to change their DNS setups. To do this, our engineers put an \u003ca href=\"https://www.geeksforgeeks.org/open-systems-interconnection-model-osi/\"\u003eL4\u003c/a\u003e reverse proxy in front of Hermes, straddling the classic and sharded environments. The idea was to slowly shift traffic over to the sharded environments, with the L4 proxy splitting connections to both the classic and the new “in-shard” Hermes instances.\u003c/p\u003e\n\n\u003cp\u003eAlso a part of this migration, TLS termination on custom domains was transitioned from Hermes to the L4 proxy.\u003c/p\u003e\n\n\u003cp\u003e\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1698271166-IMG_2180.jpeg\" alt=\"IMG_2180\"\u003e\nThis L4 proxy is the component that has formed the basis for Router 2.0. Over the past year, our networking team has been developing an L7 router to sit in-memory behind the L4 proxy. Today, the L4 proxy + Router 2.0 process runs alongside Hermes, communicating over the \u003ccode\u003elocalhost\u003c/code\u003e network on our router instances. Putting these two processes side by side, instead of on separate hosts, means we limit the number of network hops between clients and backend dynos.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='the-strangler-pattern' href='#the-strangler-pattern'\u003eThe Strangler Pattern\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eFor apps still on the default routing path, client connections are established with the L4 proxy, which directs traffic through Hermes.\n\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1698271115-IMG_2488.jpeg\" alt=\"IMG_2488\"\u003e\nWhen an \u003ca href=\"https://devcenter.heroku.com/articles/heroku-runtime-router-2-0#enable-router-2-0\"\u003eapp has Router 2.0 enabled\u003c/a\u003e, the L4 proxy instead funnels traffic over an in-memory listener to Router 2.0, then out to the app’s web dynos. Hermes is cut out of the network path.\n\u003cimg src=\"https://heroku-blog-files.s3.amazonaws.com/posts/1698339992-IMG_5679.jpeg\" alt=\"IMG_5679\"\u003e\nThis sort of architecture has a particular name — the “\u003ca href=\"https://www.redhat.com/architect/pros-and-cons-strangler-architecture-pattern\"\u003eStrangler pattern\u003c/a\u003e” — and it involves inserting a form of middleman between clients and the old system you want to replace. The middleman directs traffic, dividing it between the old system and a new system that is built out incrementally. The major advantage of such a setup is that “big bang” changes or “all-at-once” cut-overs are completely avoided. However, both the old and the new systems live on the same production hot path while the development of the new system is in progress. What has this meant for Router 2.0? Well, we had to lay a complete production-ready foundation early on.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='living-on-the-hot-path' href='#living-on-the-hot-path'\u003eLiving on the Hot Path\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eHeroku has always been an opinionated hosting and deployment platform that caters to general use cases. In our products, we optimize for stability while delivering innovation. Within the framing of Router 2.0, this commitment to stability meant we had to do a few things \u003cem\u003ebefore\u003c/em\u003e releasing beta.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='automate-router-deployments' href='#automate-router-deployments'\u003eAutomate Router Deployments\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eUp until recently, deploying Router 2.0 meant creating a new release and manually triggering router fleet cycles across all our production clouds. This process wasn’t only tedious and time-consuming, but it was also really error prone. We fixed this by building out an automation pipeline, outfitted with gates on availability metrics, performance metrics, and smoke tests. Anytime a router release fails on just one of these health indicators, it doesn’t advance to the next stage of deployment.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='load-test-continuously' href='#load-test-continuously'\u003eLoad Test Continuously\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eAn important aspect of vetting the new sharded environments in 2021 was load testing the L4 proxy/Hermes combo. At the time, this was a significant manual undertaking. After manually running these tests, it became obvious that we would need a more practical load testing story while developing Router 2.0. In response, we built a load testing system to continuously push our staging routers to their limits and trigger scaling policies, so that we can also validate our autoscaling setup. This framework has been immensely valuable for Router 2.0 development, catching bugs and regressions before they ever hit production. The results of these load tests feed right back into our deployment pipeline, blocking any deploys that don’t live up to our internal service level objectives.\u003c/p\u003e\n\u003ch3 class='anchored'\u003e\n  \u003ca name='introduce-network-error-logging' href='#introduce-network-error-logging'\u003eIntroduce Network Error Logging\u003c/a\u003e\n\u003c/h3\u003e\n\n\u003cp\u003eTraditionally, routing health has been measured through the use of “checkee” apps. These are web-server applications that we deploy across our production Common Runtime clouds and constantly probe from corresponding ”checker“ apps that run in Private Spaces. The checker-checkee duo allows us to mimic and measure our customers’ routing experience. In recent years, the gaps in this model have become more apparent. Namely, our checkees only represent the tiniest fraction of traffic pumping through the router at any given time. In addition, we can’t within our checkers possibly account for all the various client types and configurations that may be used to connect to the platform.\u003c/p\u003e\n\n\u003cp\u003eTo address the gap, we introduced \u003ca href=\"https://devcenter.heroku.com/changelog-items/2678\"\u003eNetwork Error Logging\u003c/a\u003e (NEL) to both Hermes and Router 2.0. It’s an experimental \u003ca href=\"https://www.w3.org/\"\u003eW3C\u003c/a\u003e standard that enables the measurement of routing layer performance by collecting real-time data about network failures from web browsers. Google Chrome, Microsoft Edge, and certain mobile clients already \u003ca href=\"https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/NEL#browser_compatibility\"\u003esupport\u003c/a\u003e the spec. NEL ensures our engineers maintain a more holistic understanding of the routing experience actually felt by clients.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='the-future' href='#the-future'\u003eThe Future\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eCompletely retiring Hermes will take time. We’re only at the end of the beginning of that journey. As detailed in the \u003ca href=\"https://devcenter.heroku.com/articles/heroku-runtime-router-2-0\"\u003eDev Center article\u003c/a\u003e, Router 2.0 isn’t complete yet because it doesn’t support the full list of features on our \u003ca href=\"https://devcenter.heroku.com/articles/http-routing\"\u003eHTTP Routing\u003c/a\u003e page. We’re working on it. We’ll soon be adding \u003ca href=\"https://github.com/heroku/roadmap/issues/34\"\u003eHTTP/2 support\u003c/a\u003e, one of the most requested features, to both the Common Runtime and Private Spaces. However, in the Common Runtime, HTTP/2 will only be available when your app is using Router 2.0.\u003c/p\u003e\n\n\u003cp\u003eOur aim is to achieve feature parity with Hermes, plus a little more, over the next few months. Once we’re there, we’ll focus on a migration plan that involves flagging apps into Router 2.0 automatically. Much like in the migration from classic environments to sharded environments, we’ll break the process out into phases based on small batches of apps in similar \u003ca href=\"https://devcenter.heroku.com/articles/dyno-types#dyno-tiers-and-mixing-dyno-types\"\u003edyno tiers\u003c/a\u003e. This approach gives us time to pause between phases and assess the performance of the new system.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='participating' href='#participating'\u003eParticipating\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eWe hope that you, our customers, can help us validate the new router well before it becomes the default. You can enable Router 2.0 for a Common Runtime app, by running:\u003c/p\u003e\n\n\u003cp\u003e\u003ccode\u003eheroku labs:enable http-routing-2-dot-0 -a \u0026lt;app\u0026gt;\u003c/code\u003e\u003c/p\u003e\n\n\u003cp\u003eIf you choose to enroll, you can submit feedback by commenting on the \u003ca href=\"https://github.com/heroku/roadmap/issues/219\"\u003eHeroku Public Roadmap item\u003c/a\u003e or \u003ca href=\"https://help.heroku.com/tickets/new?id=4\"\u003ecreating a support ticket\u003c/a\u003e.\u003c/p\u003e\n\u003ch2 class='anchored'\u003e\n  \u003ca name='conclusion' href='#conclusion'\u003eConclusion\u003c/a\u003e\n\u003c/h2\u003e\n\n\u003cp\u003eDelivering new features to a platform like Heroku is never as simple as flipping an on/off switch. When we deliver something to our customers, there’s always a mountain of behind-the-scenes effort put into it. Simply stated, we write a lot of software to ensure the software that you see works the way it should.\u003c/p\u003e\n\n\u003cp\u003eWe’re proud of the work we’ve done so far on Router 2.0, and we’re excited for what’s coming next. If you enroll your applications in the beta, keep an eye on the \u003ca href=\"https://devcenter.heroku.com/articles/heroku-runtime-router-2-0\"\u003eRouter 2.0 Dev Center\u003c/a\u003e page and the \u003ca href=\"https://devcenter.heroku.com/changelog\"\u003eHeroku Changelog\u003c/a\u003e. We’ll be posting updates about new features as they become available.\u003c/p\u003e\n\n\u003cp\u003eThanks for reading and happy coding!\u003c/p\u003e\n","published_at":"2023-10-30T17:00:00.000Z","permalink":"https://blog.heroku.com/router-2dot0-the-road-to-beta","tags":[],"summary":"As part of our commitment to infrastructure modernization, Heroku is upgrading the Common Runtime routing layer with the beta release of Router 2.0."}]