The sentiment analysis is performed using the "nlptown/bert-base-multilingual-uncased-sentiment" model from Hugging Face. This model is trained on product reviews in multiple languages and utilizes the BERT architecture.

As per the information on the Hugging Face model page, the accuracy of this model for sentiment analysis on English text is approximately 95%. According to our previous experiments on manually annotated Welsh reviews, the accuracy for Welsh is approximately 73%.

How do you want to categorise the sentiments?

3 Class Sentiments (Positive, Neutral, Negative)

5 Class Sentiments (Very Positive, Positive, Neutral, Negative, Very Negative)

3 Class Aspect-Based Sentiment Analysis (Positive, Neutral, Negative)

The figure displays the sentiment analysis of the data, you can press on any part of the graph to display the data

Scatter Plot Overview

In a scatter plot, the single words with the highest sentiment association are displayed. The x- and y-axes show their usage in positive vs. negative and neutral sentiments, respectively.

Colour Coding:

Blue: Positive Words

Red: Negative Words

Yellow: (Positive and Negative)

Orange: (Positive and Negative)

Towards the top-right, the most frequently shared terms between the two sentiments are found, while the bottom-left has the least frequent shared terms.

Score Range:

The range is between -1 and 1, with scores near 0 representing words with similar frequencies in both classes (yellow and orange dots). Scores near 1 are for words more frequent in positive contexts (blue), and scores near -1 for negative contexts (red). Darker shades of blue or red indicate scores closer to their respective extremes.

Interactive Features:

Hovering over the dots on the plot reveals word frequency statistics per 25,000 words for both classes and a Scaled F-Score. This frequency determines each point's plot position. For instance, a given metric might be 195:71 per 25k words. Using the query box or clicking on a dot provides more details, like the frequency per 1,000 Reddit posts ('doc').

Exclude stopwords (Regenerates the plot)

Word Cloud generator

Select cloud type:

Select cloud shape:

Select cloud outline colour:

Select cloud measurement:

Loading word cloud...

Select a word for analysis

Select Category

Word

POS Tag

Semantic Tag

Select Option

Include all data (stopwords, misfiltered words, etc.)

Left Context	Keyword	Right Context

Search Word:

Show punctuation menu

This word tree shows the words which commonly occur before and after your searched word. The bigger the font size, the more often the word occurs. The number of times each word occurs is also shown when you scroll over the word (the number after ‘weight’)

FreeTxt

How would you like to start your analysis?

Select columns

How do you want to categorise the sentiments?

Enter aspects to analyse:

Scatter Plot Overview

Colour Coding:

Score Range:

Interactive Features:

This tool, adapted from the Welsh Summarisation project, produces a basic extractive summary of the review text from the selected columns.

Word Cloud generator

Loading word cloud...

Word & Tag Associations

Select a word for analysis

This word tree shows the words which commonly occur before and after your searched word. The bigger the font size, the more often the word occurs. The number of times each word occurs is also shown when you scroll over the word (the number after ‘weight’)

The word frequency is represented by the weight in the tool tip

Include Keyword Analysis

Include Sentiment Pie Chart

Include Sentiment Bar chart

Include Word Tree Data

Include Summary

Include Word Cloud