论文标题
学习高频融资的功能控制
Learning a functional control for high-frequency finance
论文作者
论文摘要
我们使用深层神经网络来生成控制器,以最佳的高频数据交易。神经网络首次了解交易者的偏好,即风险避免参数和最佳控件之间的映射。学习此映射的一个重要挑战是,在日内交易中,交易者的行动会通过市场影响来影响封闭循环的价格动态。通过调整交易者的偏好以确保在学习阶段产生足够长的轨迹来解决探索 - 探索折衷的折衷。通过转移学习解决了财务数据稀缺的问题:由于蒙特卡罗计划,神经网络首先是对产生的轨迹进行培训,从而在培训历史轨迹之前就可以良好初始化。此外,要回答金融监管机构对机器学习产生的控件的解释性的真实要求,我们将所获得的“ BlackBox控件”投影在通常由风格化最佳交易问题的封闭式解决方案跨越的空间上,从而导致透明结构。对于没有封闭形式解决方案的更现实的损失函数,我们表明生成的控件及其可解释版本之间的平均距离仍然很小。这为财务监管机构接受ML生成的控制措施打开了大门。
We use a deep neural network to generate controllers for optimal trading on high frequency data. For the first time, a neural network learns the mapping between the preferences of the trader, i.e. risk aversion parameters, and the optimal controls. An important challenge in learning this mapping is that in intraday trading, trader's actions influence price dynamics in closed loop via the market impact. The exploration--exploitation tradeoff generated by the efficient execution is addressed by tuning the trader's preferences to ensure long enough trajectories are produced during the learning phase. The issue of scarcity of financial data is solved by transfer learning: the neural network is first trained on trajectories generated thanks to a Monte-Carlo scheme, leading to a good initialization before training on historical trajectories. Moreover, to answer to genuine requests of financial regulators on the explainability of machine learning generated controls, we project the obtained "blackbox controls" on the space usually spanned by the closed-form solution of the stylized optimal trading problem, leading to a transparent structure. For more realistic loss functions that have no closed-form solution, we show that the average distance between the generated controls and their explainable version remains small. This opens the door to the acceptance of ML-generated controls by financial regulators.