Options
MAEC: A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Prediction
Author(s)
Date Issued
2020-10-23
Date Available
2021-05-27T12:04:15Z
Abstract
In the area of natural language processing, various financial datasets have informed recent research and analysis including financial news, financial reports, social media, and audio data from earnings calls. We introduce a new, large-scale multi-modal, text-audio paired, earnings-call dataset named MAEC, based on S&P 1500 companies. We describe the main features of MAEC, how it was collected and assembled, paying particular attention to the text-audio alignment process used. We present the approach used in this work as providing a suitable framework for processing similar forms of data in the future. The resulting dataset is more than six times larger than those currently available to the research community and we discuss its potential in terms of current and future research challenges and opportunities. All resources of this work are available at https://github.com/Earnings-Call-Dataset/
Sponsorship
Science Foundation Ireland
Other Sponsorship
Insight Research Centre
Type of Material
Conference Publication
Publisher
ACM
Copyright (Published Version)
2020 ACM
Web versions
Language
English
Status of Item
Peer reviewed
Journal
CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management
Conference Details
The 29th ACM International Conference on Information and Knowledge Management, Virtual Event, (CIKM '20), 19-23 October 2020
ISBN
978-1-4503-6859-9
This item is made available under a Creative Commons License
File(s)
Loading...
Name
A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Prediction.pdf
Size
637.73 KB
Format
Adobe PDF
Checksum (MD5)
d4e4a84f495f30fd784fd928fc685161
Owning collection