Uneingeschränkter Zugang

Users’ Sentiment Analysis of Shopping Websites Based on Online Reviews


Zitieren

Introduction

The rapid development of e-commerce has deeply influenced the behavior pattern of consumers, and online shopping has become the normal way of shopping for web users currently. Because of the inability to feel the physical goods, consumers know the quality and performance characteristics of goods by IWOM (Internet Word-of-Mouth) before shopping. The shopping experience of other consumers will have an important impact on their purchasing decisions. Online reviews, as an important form of IWOM, evaluate goods in the form of text and are believed to be more trustworthy than advertising and expert recommendations [1, 2].

Users’ sentimental orientation of online reviews can have a greater impact on the consumption psychology and purchasing decisions of potential users [3]. Online reviews include a large amount of information about the description and use feeling of commodity attributes, as well as consumers’ feedback on commodity or service. How to analyze the sentimental orientation of these rich and complex online reviews and mine the hidden rule, in order to better provide decision-making basis for consumers, enterprises and shopping platform. It has become a hot research topic in marketing and the popular research direction in the field of text mining [4].

Based on the sentiment analysis technology, this paper directly explores the consumers’ sentimental orientation to the attributes of commodity or service from comments text, analyzes the user satisfaction and preference, and then discusses how the consumers, commodity producers and online shopping platforms to manage and utilize online reviews effectively.

Literature Review

Sentiment analysis, also known as opinion mining, refers to the process of semantic analysis of online reviews using text mining technique to identify the consumers’ attitudes and opinions of goods and services, to excavate the distribution of consumer's sentimental orientation [5, 6]. Sentiment analysis on online reviews involves some technologies, such as natural language processing, machine learning and text mining. The research mainly focuses on the analysis of sentimental polarity and strength, value mining and so on.

Analysis of sentimental polarity and intensity is mainly through discussing the text content of online reviews to identify consumers’ sentimental orientation or polarity and determine the polarity intensity. According to the different granularity of text processing, the analysis of sentiment polarity is mainly divided into the fine-grained analysis and the coarse-grained analysis. The former is the analysis on sentimental polarity and intensity of feature-viewpoint based on corpus and dictionary. And the latter is the judgment of overall sentimental orientation or polarity of text based on model and machine learning. The existing researches are mostly about the analysis of coarse-grained sentimental polarity [7].

Valuable online reviews can help consumers improve their decision-making efficiency and provide additional benefits for businesses. The research on value mining of online reviews mainly concentrates upon its usefulness, impact on product sales and purchasing intention. The usefulness of online reviews is a subjective perception of whether online reviews are helpful to consumers’ purchasing decisions. There may be differences in the proportion of positive sentiment and negative sentiment of different reviews and its degree of concern which leads to differences in the usefulness of reviews, and the reviews with strengths and weaknesses of product will be more useful [8,9,10]. The factors that can significantly affect the sales volume of online products are mainly the number of online reviews, product attention, timeliness of reviews, usefulness of reviews and attribute inconsistency, and the polarized reviews are more powerful than neutral ones [11, 12]. The influence research of online reviews on consumer's purchasing intention is mainly related to the influence factors, influence mechanism and influence power [13,14,15].

Online reviews often involve the commodity attributes of quality, performance, function, price, logistics and service attitude, as well as their experience and feelings of using commodity or service [16]. The emotional expression of users is also complex with affirming certain aspects and criticizing other aspects of commodity or service. To better exploit the user's attitude towards different attributes of commodity or service and deeply study its sentimental orientation, on the basis of previous research achievements, this paper attempts to use natural language processing and ontology construction technology to extract the main features of online reviews, and determine the relationship between attributes of commodity or service and its emotional words. By designing the quantitative calculation rules of sentimental polarity, the user's fine-grained sentimental orientation about the attributes of commodity or service is analyzed deeply.

This paper explains how to perform the users’ sentiment analysis of shopping websites based on online reviews. First of all, it describes the research background and literature review on sentimental polarity analysis of online reviews and its value mining; next it explains the research process of sentimental polarity analysis and how to collect and process data, preprocess text content, extract the main attributes of commodity or service, and identify the relationship between attributes and emotional words; and then it provides the reference for consumers, commodity producers and online shopping platforms according to analysis results; at the end of the full text, it comes up with the foresight and future work.

Research Design

The design process of study is shown in Figure 1, including major steps such as data collection, text content preprocessing, attribute extraction and identification of corresponding relation, classification and assignment of sentimental orientation, calculation of sentimental polarity intensity.

Fig. 1

The research process of sentiment analysis

Data Collection and Text Content Preprocessing

Data acquisition software is applied to collect the online information of relevant commodity and clean obtained data, and text mining system is used for text processing and data mining. The words playing a key role in Chinese sentiment analysis, such as nouns, adjectives, adverbs and verbs, should be highlighted.

Online reviews usually involve multiple attributes or services of a goods and comment on these attributes separately. The sentence with an emotional word or an evaluative word usually expresses the user's opinion about a particular attribute of a commodity or service. It is necessary to carry out word segmentation, speech tagging and sentence processing for each online review. Then the content is preprocessed, the synonyms are processed, and the low-frequency words without meaning and the high-frequency words without obvious meaning are removed.

Attribute Extraction of Commodity or Service

Attributes extraction of goods and services refers to the extraction of users’ concerns on a product in online reviews, aiming to understand the users’ attention to functions, performance and services of product. Attributes extraction is the basis of sentiment analysis of online reviews. Extraction method, mainly relying on the statistical theory, is to identify the product attributes based on predefined rules to count the frequency of nouns appearing in a specific domain, some attributes of commodity or service are extracted as the reviews objects according to the order of occurrence frequency and then users’ sentiment analysis is carried out.

Identification of Relationship between Attributes and Emotional Words of Commodity or Service

Emotional words refer to those words in online reviews that can more clearly express users’ position, attitude and feelings towards the attributes of commodity or services. By means of text analysis, such as speech tagging, the corresponding relationship between commodity attribute words and emotional words is judged according to its co-occurrence. Firstly, attribute words are used as the reference to find its emotional word in order backward, and then emotional words are used as the reference to find its degree adverbs in order forward.

Classification and Assignment of Sentimental Orientation

Use the positive words or negative words in comments to determine the user's sentimental orientation or sentimental polarity to an attribute of commodity or service. When making the judgment of sentimental orientation, it is necessary to combine the same attributes of commodity or service with the same meaning, and classify the sentimental orientation of these attributes into the same category.

The sentimental intensity expressed by words with the same sentimental polarity may be different. For example, “good-looking” and “amazing” are the positive sentimental polarity, but the polarity intensity of the latter is obviously greater than that of the former. The degree adverb can also change the sentimental intensity of its modifier, such as the sentimental polarity intensity of “very satisfied” is significantly greater than that of “satisfied”. In order to understand the differences in users’ sentimental orientation towards the attributes of commodity or service, it is necessary to quantify the polarity intensity of emotional words and emotional words with degree adverbs. It is divided into five different grades according to the intensity of positive emotional words and negative emotional words, as shown in Table 1.

The quantitative rule for the intensity of sentimental polarity

Sentimental polarityNegative words and degree adverbsSentimental polarity intensity
Positive emotionsextremely, very, most2
relatively, slightly1
Neutral emotionsstill, right, just right0
Negative emotionsnot good, insufficient, no−1
very, too−2
Empirical Analysis

Tmall.com, JD.com and Suning.com, accounting for more than 80% of market scale of B2C online shopping in China, are selected as the samples. These B2C shopping websites have great influence and its own characteristics, such as the marketing of Tmall.com, the supply chain of JD.com, and the offline physical stores of Suning.com. Online reviews information of popular goods is taken as the samples data to study the users’ sentiment analysis of B2C shopping websites.

Huawei, leading manufacturer of mobile phones in China, has a large fan base which is good for data collection. Online reviews about Honor 8 mobile phones are abundant in shopping websites and it can well support the research of this paper.

Data Collection and Processing

Octopus information acquisition software is used to capture the comments of mobile phone in Tmall.com, JD.com and Suning.com. The time of data collection is from the release date (listed on July 11, 2016) to January 10, 2017. After deleting repeated comments from the same user, a total of 14502 online reviews about Honor 8 are obtained.

Text mining system developed by our project team is adopted to segment words, calculate the word frequency and sort by ascending order. It can be fully prepared for extracting the attributes words through text content preprocessing and updating the dictionary of disused words and synonym.

Some words (or strings) that have nothing to do with this study are added to the dictionary of disused words, such as kkkkk, x6, 58, 12, hglll and other meaningless low-frequency words, and mobile phone, Tmall, this, huawei, today and other high-frequency words without obvious meaning. Some words with the same meaning are added to the dictionary of synonym, such words as photographs, cameras, self-portraits, photographing and other words are defined as “photograph” in the dictionary.

Attribute Extraction of Commodity or Service

The high-frequency words in online reviews of shopping websites can reflect the users’ concerns about goods and services. In general, the higher the word frequency of a commodity attribute or service attribute, the more the user is concerned about the property or service of the commodity. In this paper, the high-frequency words that line in the top 100 in shopping website (Tmall.com, JD.com and Suning.com) are selected. After removing the words with less obvious context, there are 30 high-frequency words about the attributes of commodity or services. Based on reviewing the related literature and combining with the characteristics of mobile product, 30 high-frequency words are divided into 7 categories in this paper, namely appearance, function, performance, configuration, price, quality and service, as shown in Table 2.

The attributes classification of commodity or services

Theme classificationHigh-frequency words for the attributes of commodity and services
appearanceentirety, appearance, color, cute face
functionfunction, photograph, playing games, video, fingerprint, unlocking
performanceperformance, operation, signal, charge, reaction
configurationmemory, system, screen, battery, membrane
priceprice, cost performance
qualityquality, sound quality, hand feeling
servicelogistics, customer service, express, deliver goods, delivery

The proportion of online reviews on the attributes of Honor 8 is shown in Table 3, from which the attributes of commodity or services that users are concerned about can be learned. Users are most interested in the hand feeling attribute with the largest number of online reviews (9.69%). There are more online reviews on the attributes of commodity or services including screen, appearance, logistics, photograph, customer service, operation, express, charge, system, battery, delivery goods, and their proportion of reviews are more than 5%. Among them, the attributes such as memory, system, screen and battery belong to the configuration theme, and the attributes such as logistics, customer service, express and deliver goods are classified as service theme. It shows that users usually pay more attention to the configuration and after-sales service of mobile phones. While users pay less attention to the commodity attributes of price, sound quality, color, entirety, delivery, and the number of reviews accounts for less than 2%.

The proportion and mean value of sentimental polarity intensity of attributes

Commodity attributeProportion (%)Mean of intensity
hand feeling9.691.49
screen8.590.87
appearance8.381.38
logistics7.901.46
photograph7.231.08
customer service6.871.24
operation6.801.16
express6.261.38
charge5.860.82
system5.390.80
battery5.270.19
deliver goods5.121.13
fingerprint4.991.41
cute face4.821.44
function4.610.98
reaction3.991.12
unlocking cost3.351.51
performance3.351.25
signal3.120.83
membrane3.05−0.46
quality2.941.12
memory2.830.11
performance2.441.16
playing games2.360.68
video2.270.95
price1.880.61
sound quality1.850.95
color1.681.13
entirety1.390.98
delivery1.121.16
Attribute Analysis of Commodity or Service

According to quantitative rules of sentimental polarity intensity designed in Table 1, the value of sentimental polarity intensity and its mean value (N), the same attribute of commodity or services corresponding to emotional words, is calculated one by one. The mean value of sentimental polarity intensity of online reviews of Honor 8 is shown in Table 3. Most of the attributes of commodity or services show a positive sentimental orientation, i.e., N ≥ 0.3. There are two attributes show a neutral sentimental orientation, which is 0 ≤ N < 0.3. Only one attribute indicates a negative sentimental orientation with N < 0. The value distribution of sentimental polarity intensity of the main commodity attributes is shown in Table 4.

The distribution of sentimental polarity intensity of the main commodity attributes

Attribute of commodity or serviceDistribution of sentimental polarity intensity of the main commodity attributes (%)
−2−1012
unlocking0.624.122.4729.0163.79
hand feeling00.572.8543.7752.81
logistics3.254.210.4427.3064.79
cute face0.290.141.7250.7947.06
fingerprint1.365.904.0827.3861.27
appearance00.336.9446.9945.75
express2.115.903.0030.1458.84
playing games5.4615.0211.9541.3026.28
price1.8513.3318.1554.8111.85
battery5.4326.3622.7834.5710.86
memory7.7124.3825.6233.338.96
membrane2.3550.8239.764.242.82

As shown in Table 4, users are most satisfied with unlocking function, hand feeling attribute and logistics service of Honor 8, and the mean value of their sentimental polarity intensity is close to 1.5 (as shown in Table 3). As for the unlocking function of mobile phone, its proportion of positive sentimental polarity is 92.8%, among which 63.79% of users report that the unlocking function is very fast, convenient and user-friendly. The positive sentimental polarity of hand feeling attribute is 96.58%, the number of users who feel very good, excellent and first-class account for 52.81%. The proportion of positive sentimental polarity of logistics service is 92.09%, among which 64.79% of users report that logistics is very fast, very awesome and so on.

Users are also satisfied with the attributes of cute face, fingerprint, appearance, express, cost performance, customer service, performance, operation, deliver goods, delivery, reaction, quality, photograph, function, entirety, sound quality, screen, signal, charge and system, and their mean value of sentimental polarity intensity is between 0.7 and 1.45, and their proportion of positive sentimental polarity is relatively high.

Users are satisfied with the attributes of playing games and price, and their mean value of sentimental polarity intensity is between 0.3 and 0.7. While the polarity intensity of battery and memory are relatively small, between 0 and 0.3, and their negative sentimental polarity is relatively high with 31.79% and 32.09% respectively. This shows that users believe the battery and memory configuration of mobile phone is general. The user's negative feedback on battery mainly focuses on less strong battery endurance, less use time, battery breakthrough and other aspects. The negative emotional comments on memory configuration are mainly concentrated in the system with more memory, less available memory and the inability to install memory card.

The mean value of sentimental polarity intensity of membrane attribute is negative, accounting for more than half of negative sentimental polarity of 53.17%. This indicates that most users are dissatisfied with the membrane of mobile phone. The problem of membrane feedback is mainly manifested in the lack of membrane delivery, poor appearance after film coating, and poor membrane matching of mobile phone.

Discussion and Conclusion

Based on users’ sentiment analysis of shopping websites, the following conclusions are drawn. (1) Web users usually pay more attention to the configuration of HUAWEI mobile, such as the attributes of memory, system, screen, battery, and the after-sales service of mobile, such as the attributes of logistics, customer service, express and deliver goods. (2) Users have a positive sentimental orientation to most of attributes, among which the most satisfying ones are unlocking function, hand feeling attribute and logistics service. (3) Users show a neutral sentimental orientation towards the attributes of battery and memory, and a negative sentimental orientation towards the membrane of mobile phone. The results can provide a reference for consumers to make purchasing decisions, enterprises to improve product quality, and shopping platform to optimize service direction.

For consumers, the rapidly increasing number of online reviews provides consumers with more comprehensive information, and also brings information overload which affects consumers’ judgment of online reviews. The consumers’ attitude can be automatically recognized in the huge amount of online reviews through users’ sentiment analysis on online reviews. It can help consumers to understand the distribution of commentators’ position, attitude or sentimental orientation towards the attributes of commodity or service, so as to make the right purchasing decisions. Those users of shopping websites are satisfied with most of attributes, such as hand feeling, appearance, express, customer service, performance, operation, delivery, reaction, quality, photograph, screen, charge and system. Most users think the phone's battery and memory configuration are general, and are dissatisfied with membrane of phone. Consumers have different expectations for quality, function and performance of products due to their different concerns and preferences. Consumers can pay attention to the attributes of commodity or service that they are interested in, so as to provide the decision reference for purchasing goods.

For commodity producers, users’ sentiment analysis on online reviews can reflect consumers’ concerns or preferences about products, for example, the battery endurance is not too strong and the use time is not too long; the system takes up more memory and has less available memory, the actual memory configuration does not accord with propaganda; sometimes playing games or playing video will not flow smoothly; The sound is harsh when the volume is high. The feedback information comes directly from consumers, reflecting their real idea and representing their actual needs, and should be highly valued by enterprises. Enterprises should conform to the objective reality in product promotion and solve the consumer's problem and dissatisfaction in time according to consumer feedback, in order to continuously improve product quality, service and performance. Commodity producers should formulate product marketing and development strategies in line with consumer needs to improve the image of enterprises and brands so as to win the advantages of market competitive.

For shopping platforms, users’ sentiment analysis on online reviews can report the consumers’ concern or feeling preference about the services provided by shopping platform. For example, the logistics speed is slow, the attitude of express service is not good, the customer service language is too stiff or impersonal, and the delivery is not timely. The feedback information represents the true feelings of consumers, and shopping websites can analyze its own problems and self-examination to continuously improve its service level. They can optimize delivery process, improve logistics speed and service quality, and provides users with better service experience through the integration of online and offline resources.

Although several valuable conclusions have been drawn from the research, there are some limitations: the research data of samples is limited and the characteristics of users’ sentimental orientation of different shopping websites can also be further analyzed. In the future research, more data and samples will be used to conduct a comparative study on users’ sentiment analysis of different shopping websites based on online reviews.

eISSN:
2444-8656
Sprache:
Englisch
Zeitrahmen der Veröffentlichung:
Volume Open
Fachgebiete der Zeitschrift:
Biologie, andere, Mathematik, Angewandte Mathematik, Allgemeines, Physik