Semi-strong efficient market of Bitcoin and Twitter: An analysis of semantic vector spaces of extracted keywords and light gradient boosting machine models

    Research output: Contribution to journalArticlepeer-review

    1 Scopus citations

    Abstract

    This study extends the examination of the Efficient-Market Hypothesis in Bitcoin market during a five-year fluctuation period, from September 1 2017 to September 1 2022, by analyzing 28,739,514 qualified tweets containing the targeted topic “Bitcoin”. Unlike previous studies, we extracted fundamental keywords as an informative proxy for carrying out the study of the EMH in the Bitcoin market rather than focusing on sentiment analysis, information volume, or price data. We tested market efficiency in hourly, 4-hourly, and daily time periods to understand the speed and accuracy of market reactions towards the information within different thresholds. A sequence of machine learning methods and textual analyses were used, including measurements of distances of semantic vector spaces of information, keywords extraction and encoding model, and Light Gradient Boosting Machine (LGBM) classifiers. Our results suggest that 78.06% (83.08%), 84.63% (87.77%), and 94.03% (94.60%) of hourly, 4-hourly, and daily bullish (bearish) market movements can be attributed to public information within organic tweets.

    Original languageBritish English
    Article number102692
    JournalInternational Review of Financial Analysis
    Volume88
    DOIs
    StatePublished - Jul 2023

    Keywords

    • Bitcoin
    • Efficient-market hypothesis
    • GloVe semantic vector spaces
    • LightGBM
    • Twitter

    Fingerprint

    Dive into the research topics of 'Semi-strong efficient market of Bitcoin and Twitter: An analysis of semantic vector spaces of extracted keywords and light gradient boosting machine models'. Together they form a unique fingerprint.

    Cite this