Hugging Face Transformers in Motion: Learning How To Leverage AI for NLP

(NLP) revolutionized how we interact with technology.

Do you remember when chatbots first appeared and appeared like robots? Thankfully, that’s prior to now!

Transformer models have waved their magic wand and reshaped NLP tasks. But before you drop this post considering bear with me. We won’t go into one other technical article attempting to teach you the maths behind this amazing technology, but as an alternative, we’re learning in practice what it will possibly do for us.

With the Transformers Pipeline from Hugging Face, NLP tasks are easier than ever.

Let’s explore!

The Only Explanation About What a Transformer Is

Consider transformer models because the elite of the NLP world.

Transformers excel due to their ability to concentrate on various parts of an input sequence through a mechanism called “self-attention.”

Transformers are powerful attributable to “self-attention,” a feature that enables them to choose which specific parts of a sentence are an important to concentrate on at any given time.

Ever heard of BERT, GPT, or RoBERTa? That’s them! BERT () is a revolutionary Google AI language model from 2018 that understands text context by reading words from each left-to-right and right-to-left concurrently.

Enough talk, let’s start diving into the transformers package [1].

Introduction to the Transformers Pipeline

The Transformers library offers a whole toolkit for training and running state-of-the-art pretrained models. The Pipeline class, which is our essential subject, provides an easy-to-use interface for diverse tasks, e.g.:

Text generation
Image segmentation
Speech recognition
Document QA.

Preparation

Before starting, let’s run the fundamentals and gather our tools. We’ll need Python, the transformers library, and possibly either PyTorch or TensorFlow. Installation is business-as-usual: pip install transformers.

IDEs like Anaconda or platforms like Google Colab already bring those as a regular installation. No trouble.

The Pipeline class means that you can execute many machine learning tasks using any model available on the Hugging Face Hub. It is so simple as plugging and playing.

While every task comes with a pre-configured default model and preprocessor, you’ll be able to easily customize this through the use of the model parameter to swap in a distinct model of your alternative.

Code

Let’s begin with the transformers 101 and see how it really works before we get any deeper. The primary task we are going to perform is an easy sentiment evaluation on any given news headline.

from transformers import pipeline

classifier = pipeline("sentiment-analysis")
classifier("Instagram desires to limit hashtag spam.")

The response is the next.

No model was supplied, defaulted to distilbert/distilbert-base-uncased-finetuned-sst-2-english and revision 714eb0f (https://huggingface.co/distilbert/distilbert-base-uncased-finetuned-sst-2-english).
Using a pipeline without specifying a model name and revision in production will not be beneficial.

Device set to make use of cpu
[{'label': 'NEGATIVE', 'score': 0.988932728767395}]

Since we didn’t supply a model parameter, it went with the default option. As a classification, we got that the sentiment over this headline is 98% NEGATIVE. Moreover, we could have an inventory of sentences to categorise, not only one.

Super easy, right? But that’s not only it. We will keep exploring other cool functionalities.

Zero-Shot Classification

A zero-shot classification means labelling a text that hasn’t been labelled yet. So we don’t have a transparent pattern for that. All we want to do then is pass just a few classes for the model to decide on one. This might be very useful when creating training datasets for machine learning.

This time, we’re feeding the tactic with the model argument and an inventory of sentences to categorise.

classifier = pipeline("zero-shot-classification", model = 'facebook/bart-large-mnli')
classifier(
    ["Inter Miami wins the MLS", "Match tonight betwee Chiefs vs. Patriots", "Michael Jordan plans to sell Charlotte Hornets"],
    candidate_labels=["soccer", "football", "basketball"]
    )

[{'sequence': 'Inter Miami wins the MLS',
  'labels': ['soccer', 'football', 'basketball'],
  'scores': [0.9162040948867798, 0.07244189083576202, 0.011354007758200169]},
 {'sequence': 'Match tonight betwee Chiefs vs. Patriots',
  'labels': ['football', 'basketball', 'soccer'],
  'scores': [0.9281435608863831, 0.0391676239669323, 0.032688744366168976]},
 {'sequence': 'Michael Jordan plans to sell Charlotte Hornets',
  'labels': ['basketball', 'football', 'soccer'],
  'scores': [0.9859175682067871, 0.009983371943235397, 0.004099058918654919]}]

It looks just like the model did an important job labelling these sentences!

Text Generarion

The package also can generate text. That is an excellent way of making a pleasant little story generator to inform our children before bedtime. We’re increasing the temperature parameter to make the model more creative.

generator = pipeline("text-generation", temperature=0.8)
generator("Once upon a time, in a land where the King Pineapple was")

[{'generated_text': 
"Once upon a time, in a land where the King Pineapple was a common
 crop, the Queen of the North had lived in a small village. The Queen had always 
lived in a small village, and her daughter, who was also the daughter of the Queen,
 had lived in a larger village. The royal family would come to the Queen's village,
 and then the Queen would return to her castle and live there with her daughters. 
In the middle of the night, she would lay down on the royal bed and kiss the princess
 at least once, and then she would return to her castle to live there with her men. 
In the daytime, however, the Queen would be gone forever, and her mother would be alone.
The reason for this disappearance, in the form of the Great Northern Passage 
and the Great Northern Passage, was the royal family had always wanted to take 
the place of the Queen. In the end, they took the place of the Queen, and went 
with their daughter to meet the King. At that time, the King was the only person 
on the island who had ever heard of the Great Northern Passage, and his return was
 in the past.
After Queen Elizabeth's death, the royal family went to the 
Great Northern Passage, to seek out the Princess of England and put her there. 
The Princess of England had been in"}]

Name and Entity Recognition

This task can recognize person (PER), location (LOC), or entity (ORG) in a given text. That’s great for creating quick marketing lists of lead names , for instance.

ner = pipeline("ner", grouped_entities=True)
ner("The person landed on the moon in 1969. Neil Armstrong was the primary man to step on the Moon's surface. He was a NASA Astronaut.")

[{'entity_group': 'PER', 'score': np.float32(0.99960065),'word': 'Neil Armstrong',
  'start': 36,  'end': 50},

 {'entity_group': 'LOC',  'score': np.float32(0.82190216),  'word': 'Moon',
  'start': 84,  'end': 88},

 {'entity_group': 'ORG',  'score': np.float32(0.9842771),  'word': 'NASA',
  'start': 109,  'end': 113},

 {'entity_group': 'MISC',  'score': np.float32(0.8394754),  'word': 'As',
  'start': 114,  'end': 116}]

Summarization

Possibly one of the crucial used tasks, the summarization let’s us reduce a text, keeping its essence and necessary pieces. Let’s summarize this Wikipedia page about Transformers.

summarizer = pipeline("summarization")
summarizer("""
In deep learning, the transformer is a man-made neural network architecture based
on the multi-head attention mechanism, during which text is converted to numerical
 representations called tokens, and every token is converted right into a vector via lookup
 from a word embedding table.[1] At each layer, each token is then contextualized inside the scope of the context window with other (unmasked) tokens via a parallel multi-head attention mechanism, allowing the signal for key tokens to be amplified and fewer necessary tokens to be diminished.

Transformers have the advantage of getting no recurrent units, subsequently requiring 
less training time than earlier recurrent neural architectures (RNNs) equivalent to long 
short-term memory (LSTM).[2] Later variations have been widely adopted for training
 large language models (LLMs) on large (language) datasets.[3]
""")

[{'summary_text': 
' In deep learning, the transformer is an artificial neural network architecture 
based on the multi-head attention mechanism . Transformerers have the advantage of
 having no recurrent units, therefore requiring less training time than earlier 
recurrent neural architectures (RNNs) such as long short-term memory (LSTM)'}]

Excellent!

Image Recognition

There are other, more complex tasks, equivalent to image recognition. And just as easy to make use of as the opposite ones.

image_classifier = pipeline(
    task="image-classification", model="google/vit-base-patch16-224"
)
result = image_classifier(
    "https://images.unsplash.com/photo-1689009480504-6420452a7e8e?q=80&w=687&auto=format&fit=crop&ixlib=rb-4.1.0&ixid=M3wxMjA3fDB8MHxwaG90by1wYWdlfHx8fGVufDB8fHx8fA%3D%3D"
)
print(result)

Photo by Vitalii Khodzinskyi on Unsplash

[{'label': 'Yorkshire terrier', 'score': 0.9792122840881348}, 
{'label': 'Australian terrier', 'score': 0.00648861238732934}, 
{'label': 'silky terrier, Sydney silky', 'score': 0.00571345305070281}, 
{'label': 'Norfolk terrier', 'score': 0.0013639888493344188}, 
{'label': 'Norwich terrier', 'score': 0.0010306559270247817}]

So, with these couple of examples, it is simple to see how easy it’s to make use of the Transformers library to perform different tasks with little or no code.

Wrapping Up

What if we wrap up our knowledge by applying it in a practical, small project?

Allow us to create an easy Streamlit app that may read a resumé and return the sentiment evaluation and classify the tone of the text as ["Senior", "Junior", "Trainee", "Blue-collar", "White-collar", "Self-employed"]

In the subsequent code:

Import the packages
Create Title and subtitle of the page
Add a text input area
Tokenize the text and split it in chunks for the transformer task. See the list of models [4].

import streamlit as st
import torch
from transformers import pipeline
from transformers import AutoTokenizer
from langchain_community.document_loaders import PyPDFLoader
from langchain.text_splitter import RecursiveCharacterTextSplitter

st.title("Resumé Sentiment Evaluation")
st.caption("Checking the sentiment and language tone of your resume")

# Add input text area
text = st.text_area("Enter your resume text here")

# 1. Load your required tokenizer
model_checkpoint = "bert-base-uncased" 
tokenizer = AutoTokenizer.from_pretrained(model_checkpoint)

# 2. Tokenize the text without padding or truncation
# We return tensors or lists to slice them manually
tokens = tokenizer(text, add_special_tokens=False, return_tensors="pt")["input_ids"][0]

# 3. Instantiate Text Splitter with Chunk Size of 500 words and Overlap of 100 words in order that context will not be lost
text_splitter = RecursiveCharacterTextSplitter(chunk_size=500, chunk_overlap=100)

# 4. Split into chunks for efficient retrieval
chunks = text_splitter.split_documents(text)

# 5. Convert back to strings or add special tokens for model input
decoded_chunks = []
for chunk in chunks:
    # This adds [CLS] and [SEP] and converts back to a format the model likes
    final_input = tokenizer.prepare_for_model(chunk.tolist(), add_special_tokens=True)
    decoded_chunks.append(tokenizer.decode(final_input['input_ids']))

st.write(f"Created {len(decoded_chunks)} chunks.")

Next, we are going to initiate the transformer’s pipeline to:

Perform the sentiment evaluation and return the arrogance %.
Classify the text tone and return the arrogance %.

# Initialize sentiment evaluation pipeline
sentiment_pipeline = pipeline("sentiment-analysis")

# Perform sentiment evaluation    
if st.button("Analyze"):
    col1, col2 = st.columns(2)

    with col1:  
        # Sentiment evaluation
        sentiment = sentiment_pipeline(decoded_chunks)[0]
        st.write(f"Sentiment: {sentiment['label']}")
        st.write(f"Confidence: {100*sentiment['score']:.1f}%")
    
    with col2:
        # Categorize tone
        tone_pipeline = pipeline("zero-shot-classification", model = 'facebook/bart-large-mnli',
                                candidate_labels=["Senior", "Junior", "Trainee", "Blue-collar", "White-collar", "Self-employed"])
        tone = tone_pipeline(decoded_chunks)[0]
        
        st.write(f"Tone: {tone['labels'][0]}")
        st.write(f"Confidence: {100*tone['scores'][0]:.1f}%")

Here’s the screenshot.

Sentiment and Language Tone Evaluation. Image by the creator.

Before You Go

Hugging Face (HF) Transformers Pipelines are truly a game-changer for data practitioners. They supply an incredibly streamlined solution to tackle complex machine learning tasks, like text generation or image segmentation, using just just a few lines of code.

HF has already done the heavy lifting by wrapping sophisticated model logic into easy, intuitive methods.

This shifts the main target away from low-level coding and allows us to concentrate on what really matters: using our creativity to construct impactful, real-world applications.

In the event you liked this content, find more about me in my website.

https://gustavorsantos.me

GitHub Repository

https://github.com/gurezende/Resume-Sentiment-Evaluation

References

[1. Transformers package] https://huggingface.co/docs/transformers/index

[2. Transformers Pipelines] https://huggingface.co/docs/transformers/pipeline_tutorial

[3. Pipelines Examples] https://huggingface.co/learn/llm-course/chapter1/3#summarization

[3. HF Models] huggingface.co/models

Hugging Face Transformers in Motion: Learning How To Leverage AI for NLP

The Only Explanation About What a Transformer Is

Introduction to the Transformers Pipeline

Preparation

Code

Zero-Shot Classification

Text Generarion

Name and Entity Recognition

Summarization

Image Recognition

Wrapping Up

Before You Go

GitHub Repository

References

What are your thoughts on this topic?
Let us know in the comments below.

Share this article

Recent posts

Breaking the Hardware Barrier: Software FP8 for Older GPUs

Introduction to ggml

Infini-Attention, and why we must always keep trying?

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

Hugging Face Transformers in Motion: Learning How To Leverage AI for NLP

The Only Explanation About What a Transformer Is

Introduction to the Transformers Pipeline

Preparation

Code

Zero-Shot Classification

Text Generarion

Name and Entity Recognition

Summarization

Image Recognition

Wrapping Up

Before You Go

GitHub Repository

References

What are your thoughts on this topic? Let us know in the comments below.

Share this article

Recent posts

What are your thoughts on this topic?
Let us know in the comments below.