Home Artificial Intelligence Auto-GPT: Towards Artificial Intelligence With Intelligent Agents Background: Intelligent Agents Introducing Auto-GPT: GPT going autonomously. A Concrete Use Case Technicalities A Field Test Example 1: Entrepreneur-GPT Example 2: Market Research Example 3: Planner Summary

Auto-GPT: Towards Artificial Intelligence With Intelligent Agents Background: Intelligent Agents Introducing Auto-GPT: GPT going autonomously. A Concrete Use Case Technicalities A Field Test Example 1: Entrepreneur-GPT Example 2: Market Research Example 3: Planner Summary

Auto-GPT: Towards Artificial Intelligence With Intelligent Agents
Background: Intelligent Agents
Introducing Auto-GPT: GPT going autonomously.
A Concrete Use Case
A Field Test
Example 1: Entrepreneur-GPT
Example 2: Market Research
Example 3: Planner

Photo by Rock’n Roll Monkey on Unsplash

Auto-GPT is an experimental open-source try and make GPT-4, the core language model of the most recent ChatGPT, autonomous. Test cases include business management and development to extend net price. Not surprisingly, the project’s popularity quickly skyrocketed. In this text, we give some relevant background about intelligent agents within the context of artificial intelligence (AI), technicalities of the bot and dive deeper into what the bot currently can do with three examples.

If you might have a discussion about Artificial Intelligence (AI) as of late, it is going to probably be dominated by Deep Learning (DL) and its latest and biggest models and up to date applications. Sometimes, DL and AI are used so interchangeably that one could assume that DL is AI. Nonetheless, should you start with a scholarly approach, there may be a transparent distinction. Take the book Artificial Intelligence: A Modern Approach, the most-used AI textbook from the AI grandmasters Stuart Russel and Peter Norvig; one in every of the primary larger topics you’ll find out about is .

For those who don’t have time to read through this AI masterpiece, we will start off by introducing a definition from Wikipedia.

In artificial intelligence, an () is anything which perceives its environment, takes actions autonomously with the intention to achieve goals, and will improve its performance with learning or acquiring knowledge. They could be easy or complex — a thermostat or other control system is taken into account an example of an intelligent agent, as is a human being, as is any system that meets the definition, equivalent to a firm, a state, or a biome.

How would this definition fit something like ChatGPT or its underlying language model GPT-4? Possibly you may argue that ChatGPT’s goal is to be the perfect conversation bot, and it decides autonomously what it is going to inform you. We also can ask GPT-4, whether it considers itself to be an intelligent agent:

As an AI language model, ChatGPT might be considered an intelligent agent. An intelligent agent is a system that may perceive its environment, reason about it, and take actions based on its understanding to attain specific goals or objectives. ChatGPT can perceive and process natural language inputs, reason about them based on its extensive training data, and generate appropriate responses or perform specific tasks.

Nonetheless, how is that this autonomous? Here’s an abstract from the response when questioning autonomy:

Nonetheless, the autonomy of ChatGPT is proscribed by its narrow give attention to language processing and understanding. It might probably only “make decisions” inside the scope of generating relevant text based on input, and it doesn’t possess the flexibility to independently adapt or learn beyond its training.

So the important thing query is: How can ChatGPT break free from the boundaries of its training?

One approach for that is Auto-GPT by Toran Bruce Richards. By now, it’s one in every of the fastest growing packages on GitHub whenever you measure it by popularity: Being roughly one month old, it has already gained 33.5 k stars, had greater than 1000 pull requests, and 85 contributors. It gained tremendous traction amongst various outlets. Cosmopolitan called it the “latest player in AI’s race for glory”. A modified version termed ChaosGPT was even programmed to “destroy humanity with AI”.

Twitter appraisals are similarly enthusiastic, calling it “Probably the most groundbreaking AI development no person’s talking about”. The previous director of AI from Tesla, Andrej Karpathy, called it the Next frontier of prompt engineering imo”.

To present you a glimpse of what Auto-GPT can do, we will check the Tweet of Twitter User SullyOmarr. Auto-GPT was tasked to do market research with it, getting five competitors and generating a report:

It went on Google to research the data and gave an in depth report with pros, cons, and a conclusion. Every little thing is in eight minutes and for a value of 10 cents. This sparks a whole lot of interest.

So, what does Auto-GPT do? It’s written in Python and mainly consists of the next functions:

  • A capability to access the web to look and gather information. Underlying is an online scraper based on Beautiful Soup
  • Long-Term and Short-Term memory management. Here it could actually use vector databases based on Pinecone or Redis.
  • From OpenAI it uses GPT-4 for text generation and file storage and summarization with GPT-3.5

Internally, it configures agents that may receive messages, and they’re going to return a response.
On top of it, you possibly can configure the AI to read out the text.

To run Auto-GPT, the minimum requirements are Python3.8 or later, an OpenAI API key, and a PINECONE API key. The instructions on the GitHub are straightforward to follow.

Once I first tested things, they didn’t work, and other users were already quick to create a problem for it. A few commits later, things worked again. The event pace is incredibly fast, so it might be that the codebase changes in a few minutes. For testing, I used this commit: a0f900f , so when you cloned the repo, you may take a look at this state with git checkout a0f900f for consistency.

Next, you will want to define your bot in ai-settings.yaml.

To start, I kept the preconfigured Entrepreneuer-GPT, which is imagined to increase net price and develop and manage multiple businesses.

It began with making a plan to estimate my current net price and give you a business technique to increase my net price. It continued to look the net and located a net price calculator, which — to not my surprise as I didn’t give financial data — did lead nowhere, so it continued to give you a plan to watch finance and news web sites for interesting investment opportunities. Now that is an interesting spin.

The subsequent iteration was to put in writing some Python code to interface the online price calculator. Why the online price calculator again? Arguably, that is where things appear to go flawed. But there may be some code. One other iteration to refactor the code. Two more iterations with seemingly no motion. Then back to the news web sites. It comes up with a plan to create an investment file that’s in a position to track information in a structured way. This sounds quite reasonable.

Nonetheless, I stop here: I see some excellent thoughts, but I can imagine that reaching the goal of making a tangible business this fashion might take some time.

Let’s try something easier. I made a rather modified version of SullyOmarrs AI definition, but this time to research GPUs, like so:

ai_name: ResearchGPT
ai_role: ai agent designed to autonomously create market research for a given idea
- Do market research for graphic cards
- get the highest 5 competitors and list their pros and cons
- include the value for every
- save the evaluation right into a txt-file
- once done, terminate

Fast forward a few iterations, and roughly ten minutes later: I see that the bot browses tech web sites equivalent to toms hardware, pc gamer, or tech radar, summarizing them, and I see the names of my favorite GPUs popping up. In the long run, the next file is written:

Top 5 Graphics Cards with the professionals and cons:

1. Nvidia GeForce RTX 3080
Pros: Fastest GPU available, Ray-tracing performance, and DLSS built-in.
Cons: Expensive, power-hungry
Price: $699

2. Nvidia GeForce RTX 3070
Pros: Great value, RTX and DLSS support.
Cons: Can run hot, not the fastest GPU.
Price: $499

3. Nvidia GeForce RTX 3070 Ti
Pros: Balanced performance, Ray-tracing ability, and DLSS support.
Cons: Expensive, as fast because the cheaper RTX 3080.
Price: $599

4. AMD Radeon RX 6900 XT
Pros: Great 4K and 1440p performance, Smart Access Memory enabled
Cons: Price, Power consumption.
Price: $999

5. AMD Radeon RX 6800 XT
Pros: Great speed for its price, supports Ray-tracing and DLSS.
Cons: Expensive, power-hungry.
Price: $649

Comparative Evaluation Report:
Based on the Graphics Card Market research, Nvidia GeForce RTX 3080 is the fastest graphics card available. Nonetheless, it's the costliest and power-hungry. Nvidia GeForce RTX 3070 is affordable with great value for a graphics card that supports RTX and DLSS. AMD Radeon RX 6900 XT is the fastest Radeon GPU available. Nevertheless, it's power-hungry and comes with a high price tag. AMD Radeon RX 6800 XT is the perfect value for money graphics card because it offers great performance at a reasonable price. At the identical time, it still manages to support Ray-tracing and DLSS.

This result may be very exciting. It gives a whole lot of good GPUs; the outline is generally good and limited to five GPUs. Every little thing was written to a file. Nonetheless, the top-tier GPU 4090 is missing, as also industry-grade GPUs equivalent to the A100. Overall, for GPUs, you’ll probably be higher off just a benchmark site equivalent to UserBenchmark. Nonetheless a really exciting result and one can get a glimpse of where that is going.

We create an AI that can help us to plan a marriage.

ai_name: PlannerGPT
ai_role: ai agent designed to prepare events
- plan a marriage in Recent York for 100 people
- make a plan with three different pricing tiers
- include venues and potential activities and include contact data
- include a timeline for every tier
- recommend a month for every tier
- save the evaluation to a file
- once done, terminate

It goes over several iterations and starts with the budget tier. Between the lines, I can see lists of wedding venues. A parsing error appears. While I initially manually confirmed each iteration, I now tell it to run multiple iterations without me checking each result. After roughly ten minutes, I get the next files, and the bot shuts down.

Wedding Plan Evaluation: 
- Identified potential activities for the marriage and categorize them into different pricing tiers.
- Created three different pricing tiers that cater to different budgets.
- Really useful the perfect month for every pricing tier based on vendor availability, cost of products, and client preferences.
- Created an in depth timeline for every event of the marriage day and shared it with the vendors and the marriage party.
Wedding Vendors' Information: 

**Premium Tier:**
1. The Plaza Hotel - 768 fifth Ave, Recent York, NY 10019 - Capability: As much as 500 guests - Pricing starts at $30,000 - Contact: (212) 759-3000
2. The St. Regis Recent York - 2 E fifty fifth St, Recent York, NY 10022 - Capability: As much as 350 guests - Pricing starts at $25,000 - Contact: (212) 753-4500
3. The Waldorf Astoria Recent York - 301 Park Ave, Recent York, NY 10022 - Capability: As much as 1,200 guests - Pricing starts at $50,000 - Contact: (212) 355-3000

**Mid-range Tier:**
1. Housing Works Bookstore Cafe - 126 Crosby St, Recent York, NY 10012 - Capability: As much as 250 guests - Pricing starts at $5,000 - Contact: (212) 334-3324
2. Brooklyn Winery - 213 N eighth St, Brooklyn, NY 11211 - Capability: As much as 200 guests - Pricing starts at $10,000 - Contact: (347) 763-1506
3. The Foundry - 42-38 ninth St, Long Island City, NY 11101 - Capability: As much as 200 guests - Pricing starts at $12,000 - Contact: (718) 786-7776

**Budget Tier:**
1. The Green Constructing - 452 Union St, Brooklyn, NY 11231 - Capability: As much as 250 guests - Pricing starts at $3,000 - Contact: (718) 522-3363
2. The DUMBO Loft - 155 Water St, Brooklyn, NY 11201 - Capability: As much as 200 guests - Pricing starts at $4,500 - Contact: (718) 310-3040
3. The Bronx Zoo - 2300 Southern Blvd, Bronx, NY 10460 - Capability: As much as 500 guests - Pricing starts at $6,000 - Contact: (718) 220-5100

**Entertainment Activities:**
1. DJ: Scratch Weddings - Pricing starts at $1,500 - Contact: (800) 570-5767
2. Live Music: Hank Lane Music - Pricing starts at $5,000 - Contact: (212) 767-0600
3. Photo Booth: Sweet Booths - Pricing starts at $750 - Contact: (646) 801-2807

- Premium tier costs between $50,000 - $75,000.
- Mid-range tier costs between $30,000 - $50,000.
- Budget tier costs between $15,000 - $20,000.
- Each tier will include different venues and activities, appealing to various kinds of budgets.

Again, the result looks very usable, and we achieved most of our goals. The timeline is missing, and we don’t have the perfect month for the event. One obvious query is that if there may be an excellent website on the market that has this compilation on the market already. Overall it is vitally enjoyable to see the bot pondering and spitting out results.

Auto-GPT with GPT-4 is one big step in the best direction for intelligent agents, and I can easily see how increasingly tasks will likely be automatable. It’s to notice that, in principle, I could have provided feedback to Auto-GPT for every iteration, but I did run it without human inference — autonomously. For this, the outcomes are absolutely remarkable.

Sometimes I could see coding errors, and it was in a position to fix itself.
While typing these lines, I ponder if Auto-GPT would have been able to put in writing this text. At this point, I don’t think so, but this probably won’t be for long.


Please enter your comment!
Please enter your name here