2019, 浙大AZFT
Deep neural networks have achieved promising results in stock trend prediction.
However, most of these models have two common drawbacks, including (i) current methods are not sensitive enough to abrupt changes of stock trend, and (ii) forecasting results are not interpretable for humans.
To address these two problems, we propose a novel Knowledge-Driven Temporal Convolutional Network (KDTCN) for stock trend prediction and explanation.
Firstly, we extract structured events from financial news, and utilize external knowledge from knowledge graph to obtain event embeddings. Then, we combine event embeddings and price values together to forecast stock trend.
We evaluate the prediction accuracy to show how knowledge-driven events work on abrupt changes. We also visualize the effect of events and linkage among events based on knowledge graph, to explain why knowledge-driven events are common sources of abrupt changes. Experiments demonstrate that KDTCN can (i) react to abrupt changes much faster and outperform state-of-the-art methods on stock datasets, as well as (ii) facilitate the explanation of prediction particularly with abrupt changes.
- 这种说法,abrupt changes of stock trend
- event embedding怎么做的: event extraction + knowledge graph
- 好像效果还不错?整个TCN的效果看起来精度不错
2016, 哈工大SCIR
Representing structured events as vectors in continuous space offers a new way for defining dense features for natural language processing (NLP) applications. Prior work has proposed effective methods to learn event representations that can capture syntactic and semantic information over text corpus, demonstrating their effectiveness for downstream tasks such as event-driven stock prediction.
On the other hand, events extracted from raw texts do not contain background knowledge on entities and relations that they are mentioned.
To address this issue, this paper proposes to leverage extra information from knowledge graph, which provides ground truth such as attributes and properties of entities and encodes valuable relations between entities.
Specifically, we propose a joint model to combine knowledge graph information into the objective function of an event embedding learning model.
Experiments on event similarity and stock market prediction show that our model is more capable of obtaining better event embeddings and making more accurate prediction on stock market volatility.
- 缺乏领域知识的说法,它使用KG来引入的?
- event embedding怎么做的
- 怎么精度效果也不错?
- 哈工大SCIR
Learning Target-Specific Representations of Financial News Documents For Cumulative Abnormal Return Prediction
2018, 哈工大SCIR
Texts from the Internet serve as important data sources for financial market modeling. Early statistical approaches rely on manually defined features to capture lexical, sentiment and event information, which suffers from feature sparsity. Recent work has considered learning dense representations for news titles and abstracts.
Compared to news titles, full documents can contain more potentially helpful information, but also noise compared to events and sentences, which has been less investigated in previous work.
To fill this gap, we propose a novel target-specific abstract-guided news document representation model.
The model uses a target-sensitive representation of the news abstract to weigh sentences in the news content, so as to select and combine the most informative sentences for market modeling.
Results show that document representations can give better performance for estimating cumulative abnormal returns of companies when compared to titles and abstracts. Our model is especially effective when it used to combine information from multiple document sources compared to the sentence-level baselines.
- 手动构建的特征有feature sparsity的说法,而全文有噪声
- event embedding怎么做的,他用新闻摘要辅助从全文中抽取
2018, 微软AI for Finance
Stock trend prediction plays a critical role in seeking maximized profit from the stock investment. However, precise trend prediction is very difficult since the highly volatile and non-stationary nature of the stock market. Exploding information on the Internet together with the advancing development of natural language processing and text mining techniques have enabled investors to unveil market trends and volatility from online content.
Unfortunately, the quality, trustworthiness, and comprehensiveness of online content related to stock market vary drastically, and a large portion consists of the low-quality news, comments, or even rumors.
To address this challenge, we imitate the learning process of human beings facing such chaotic online news, driven by three principles: sequential content dependency, diverse influence, and effective and efficient learning.
In this paper, to capture the first two principles, we designed a Hybrid Attention Networks (HAN) to predict the stock trend based on the sequence of recent related news. Moreover, we apply the self-paced learning mechanism to imitate the third principle.
Extensive experiments on real-world stock market data demonstrate the effectiveness of our framework. A further simulation illustrates that a straightforward trading strategy based on our proposed framework can significantly increase the annualized return.
- 新闻、评论等低质量,他们模仿人类的三个学习准则
- 什么是self-paced learning mechanism?
- 这是怎么衡量准确率的?
2018, 北邮
It has been shown that stock price movements are influenced by news. To predict stock movements with news, many existing works rely only on the news title since the news content may contain irrelevancies which seriously degrade the prediction accuracy.
However, we observe that there is still useful information in the content which is not reflected in the title, and simply ignoring the content will result in poor performance.
In this paper, taking advantage of neural representation learning, we propose a hierarchical complementary attention network (HCAN) to capture valuable complementary information in news title and content for stock movement prediction.
In HCAN, we adopt a two-level attention mechanism to quantify the importances of the words and sentences in a given news. Moreover, we design a novel measurement for calculating the attention weights to avoid capturing redundant information in the news title and content.
Experimental results on news datasets show that our proposed model outperforms the state-of-the-art techniques.
- 他说的many existing works rely only on the news title具体指哪些?现在还有参考价值吗?现在用全文还是标题多?
- 计算注意力权重的新方法?
- 哎,二分类精度也有60%?
2013, 港城大
This paper proposes a technique to leverage topic based sentiments from Twitter to help predict the stock market.
We first utilize a continuous Dirichlet Process Mixture model to learn the daily topic set. Then, for each topic we derive its sentiment according to its opinion words distribution to build a sentiment time series. We then regress the stock index and the Twitter sentiment time series to predict the market.
Experiments on real-life S&P100 Index show that our approach is effective and performs better than existing state-of-the-art non-topic based methods.
- 就这二分类都有60%的准确率???
2018, 华师大
In addition to only considering stocks’ price series, utilizing short and instant texts from social medias like Twitter has potential to yield better stock market prediction.
While some previous approaches have explored this direction, their results are still far from satisfactory due to their reliance on performance of sentiment analysis and limited capabilities of learning direct relations between target stock trends and their daily social texts.
To bridge this gap, we propose a novel Cross-modal attention based Hybrid Recurrent Neural Network (CH-RNN), which is inspired by the recent proposed DA-RNN model.
Specifically, CH-RNN consists of two essential modules. One adopts DA-RNN to gain stock trend representations for different stocks. The other utilizes recurrent neural network to model daily aggregated social texts. These two modules interact seamlessly by the following two manners: 1) daily representations of target stock trends from the first module are leveraged to select trend-related social texts through a cross-modal attention mechanism, and 2) representations of text sequences and trend series are further integrated.
The comprehensive experiments on the real dataset we build demonstrate the effectiveness of CH-RNN and benefit of considering social texts.
- learning direct relations他做到了吗
- 准确率反而没达到59.15%?
- cross-modal的说法
2018, University of Edinburgh
Stock movement prediction is a challenging problem: the market is highly stochastic, and we make temporally-dependent predictions from chaotic data.
We treat these three complexities and present a novel deep generative model jointly exploiting text and price signals for this task.
Unlike the case with discriminative or topic modeling, our model introduces recurrent, continuous latent variables for a better treatment of stochasticity, and uses neural variational inference to address the intractable posterior inference. We also provide a hybrid objective with temporal auxiliary to flexibly capture predictive dependencies.
We demonstrate the state-of-the-art performance of our proposed model on a new stock movement prediction dataset which we collected.
- recurrent, continuous latent variables是什麽,为什么能处理随机性
- neural variational inference是什么,怎样处理the intractable posterior inference
- hybrid objective with temporal auxiliary?
2015
- stakeholder analysis, Expert systems
- Stakeholder Theory of the Firm
2014
- 有个对过往研究对比的表格,包括数据、方法
- 就这样,精度都有0.5以上,可以作为对比?
2018, 四川金智金工和西南财经
- 有个对过往研究对比的表格,包括所使用的信息来源(新闻、社交媒体、论坛)和Textual Representation方法
2021, 四川金智金工和西南财经
In finance, it is believed that market information, namely, fundamentals and news information, affects stock movements. Such media-aware stock movements essentially comprise a multimodal problem.
Two unique challenges arise in processing these multimodal data. First, information from one data mode will interact with information from other data modes. A common strategy is to concatenate various data modes into one compound vector; however, this strategy ignores the interactions among different modes.
The second challenge is the heterogeneity of the data in terms of sampling time. Specifically, fundamental data consist of continuous values sampled at fixed time intervals, whereas news information emerges randomly. This heterogeneity can cause valuable information to be partially missing or can distort the feature spaces.
In addition, the study of media-aware stock movements in previous work has focused on the one-to-one problem, in which it is assumed that news affects only the performance of the stocks mentioned in the reports. However, news articles also impact related stocks and cause stock co-movements.
In this article, we propose a tensor-based event-driven LSTM model to address these challenges.
Experiments performed on the China securities market demonstrate the superiority of the proposed approach over state-of-the-art algorithms, including AZFinText, eMAQT, and TeSIA.
- 新闻信息和股价数据时间上不一样