TY - GEN
T1 - Cross-Course and Multi-course Sentiment Classification of Student Posts
AU - Dolianiti, Foteini
AU - Iakovakis, Dimitrios
AU - Dias, Sofia B.
AU - Hadjileontiadou, Sofia
AU - Diniz, José A.
AU - Natsiou, Georgia
AU - Tsitouridou, Melpomeni
AU - Hadjileontiadis, Leontios
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020
Y1 - 2020
N2 - Affective Computing is one of the most active research topics in education. Increased interest in emotion recognition through text channels makes sentiment analysis (i.e., the Natural Language Processing task of determining the valence in texts) a state-of-the-practice tool. Considering the domain-dependent nature of sentiment analysis as well as the heterogeneity of the educational domain, development of robust sentiment classifiers requires an in-depth understanding of the effect of the teaching-learning context on model performance. This work investigates machine learning-based sentiment classification on datasets comprised of student posts in forums, pertaining to two different academic courses. Different dataset configurations were tested, aiming to compare performance: i) between single-course and multi-course classifiers, ii) between in-course and cross-course classification. A sentiment classifier was built for each course, exhibiting a fair performance. However, classification performance dramatically decreased, when the two models were transferred between courses. Additionally, classifiers trained on a mixture of courses underperformed single-course classifiers. Findings suggested that sentiment analysis is a course-dependent task and, as a rule of thumb, less but course-specific information results in more effective models than more but non-specialized information.
AB - Affective Computing is one of the most active research topics in education. Increased interest in emotion recognition through text channels makes sentiment analysis (i.e., the Natural Language Processing task of determining the valence in texts) a state-of-the-practice tool. Considering the domain-dependent nature of sentiment analysis as well as the heterogeneity of the educational domain, development of robust sentiment classifiers requires an in-depth understanding of the effect of the teaching-learning context on model performance. This work investigates machine learning-based sentiment classification on datasets comprised of student posts in forums, pertaining to two different academic courses. Different dataset configurations were tested, aiming to compare performance: i) between single-course and multi-course classifiers, ii) between in-course and cross-course classification. A sentiment classifier was built for each course, exhibiting a fair performance. However, classification performance dramatically decreased, when the two models were transferred between courses. Additionally, classifiers trained on a mixture of courses underperformed single-course classifiers. Findings suggested that sentiment analysis is a course-dependent task and, as a rule of thumb, less but course-specific information results in more effective models than more but non-specialized information.
KW - Education
KW - Natural language processing
KW - Sentiment analysis
UR - http://www.scopus.com/inward/record.url?scp=85092664038&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-60735-7_6
DO - 10.1007/978-3-030-60735-7_6
M3 - Conference contribution
AN - SCOPUS:85092664038
SN - 9783030607340
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 55
EP - 65
BT - Brain Function Assessment in Learning - 2nd International Conference, BFAL 2020, Proceedings
A2 - Frasson, Claude
A2 - Bamidis, Panagiotis
A2 - Vlamos, Panagiotis
PB - Springer Science and Business Media Deutschland GmbH
T2 - 2nd International Conference on Brain Function Assessment in Learning, BFAL 2020
Y2 - 9 October 2020 through 11 October 2020
ER -