Home Artificial Intelligence 10 Exciting Project Ideas Using Large Language Models (LLMs) for Your Portfolio Projects Based on Text Generation Projects Based on Text Representation Advanced Projects (Project Idea 10: Where is Waldo) Summary Enjoyed This Story?

10 Exciting Project Ideas Using Large Language Models (LLMs) for Your Portfolio Projects Based on Text Generation Projects Based on Text Representation Advanced Projects (Project Idea 10: Where is Waldo) Summary Enjoyed This Story?

3
10 Exciting Project Ideas Using Large Language Models (LLMs) for Your Portfolio
Projects Based on Text Generation
Projects Based on Text Representation
Advanced Projects (Project Idea 10: Where is Waldo)
Summary
Enjoyed This Story?

Listed here are the rough steps you’ll follow to understand this project:

  1. Download the video or podcast transcript and cargo into documents
  2. Split long documents into chunks
  3. Summarize the transcript with an LLM
  4. Optional: Wrap all of it in a user-friendly command line interface and even an internet application.

Project Idea 4: Information Extraction

One other useful use case of LLMs is . For instance, you possibly can provide an LLM with a number of examples that contain text and the knowledge you would like it to extract.

Rember the cover letter generator from earlier? You possibly can extend it with a component to extract the relevant information from a job posting directly:

prompt = """
This program will extract relevant information from a job posting.
Listed here are some examples:

Job posting:
Lead engineer for software integration (distant possible)

At XYZ Co. we're making the world a greater place.
To achieve this we're on the lookout for a lead engineer with experience in Python and JIRA.

Extracted Text:
Role: Lead engieer for software integration.
Company: XYZ Co.
Requirements: Python, JIRA
--
Job posting:
Senior software engineer - Autonomous Mobility

ABC Inc. is an amazing company.
We're on the lookout for someone with great ability to jot down complex C code.

Extracted Text:
"""

Listed here are the rough steps you’ll follow to understand this project:

  1. Load job description from job posting right into a document
  2. Extract the relevant information with the LLM by prompt engineering a prompt using examples

Project Idea 5: Web Scraper

LLMs are exceptional at (transforming) texts, corresponding to

  • rewriting text in a selected style (e.g., the type of “The Economist” or “Latest Yorker”)
  • rewriting text in a selected reading level (e.g., level grade 6 for easier readability)
  • reformatting information from any format to some other format
  • text correction (e.g., spelling and grammar)
  • translations

It is vitally common to make use of LLMs to convert text from one form to a different.

A creative idea to use this rewriting capability is to make use of it for web scraping. If you might have ever written an internet scraper, you recognize how tedious it’s. What if you happen to could use LLMs to construct a more generic solution to extract data from unstructured web sites?

This is precisely what mangotree has done:

Listed here are the rough steps you’ll follow to understand this project:

  1. Scrape the web site’s source code and cargo right into a document
  2. Split long documents into chunks
  3. Extract the relevant data from the source code using the LLM (see extraction)
  4. Reformat the extracted data into the specified format with the LLM by prompt engineering a prompt using examples

The project ideas up to now were based on the thought of generating latest text. But one other use case of LLMs relies on the thought of text representations. You’ll be able to input the text to an embeddings model and extract the numerical representation of this text — the “text embeddings”.

These text embeddings enable you to perform mathematical operations, including similarity calculations, or apply Machine Learning algorithms.

On this section, we’ll discuss some project ideas based on use cases related to them:

  1. Search and similarity: searchable database of your documents
  2. Query answering: query answering over documents or code base
  3. Clustering: clustering social media posts and podcast episodes into topics
  4. Classification: classify business inquiries from e-mails

Project Idea 6: Searchable Database of Your Documents

Embeddings will help us seek for content based on . In contrast to keyword-based search engines like google, we will calculate the similarity of a document’s embeddings to the embeddings of a search query.

For instance, you possibly can turn your personal documents right into a searchable database:

One other neat project is Andrej Karpathy’s weekend hack that lets you seek for a selected movie:

Listed here are the rough steps you’ll follow to understand a project like these:

  1. Load the files into documents
  2. Split long documents into chunks
  3. Generate and store the embeddings from the documents with an embeddings model
  4. Define the index query to retrieve the relevant files

Project Idea 7: Query Answering over Documents

may be viewed as a mix of search (see search) and summarization (see summarization). It may well help work through any document in a more intuitive way.

You should use it to chat together with your documents or any code base:

Listed here are the rough steps you’ll follow to understand this project:

  1. Load source code into documents
  2. Split long documents into chunks
  3. Generate and store the embeddings from the documents with an embeddings model
  4. Define the index query to retrieve context and prompt the LLM on it

Project Idea 8: Clustering Documents into Topics

Except for querying documents or information from said documents, it’s also possible to use embeddings to place documents into categories by utilizing (unsupervised learning).

For instance, you need to use clustering to search out topics in a podcast episode.

Or you possibly can cluster posts on a web-based forum into topics.

3 COMMENTS

LEAVE A REPLY

Please enter your comment!
Please enter your name here