With string model to time series forecasting

Ivan Chan

visibility

…

description

14 pages

link

1 file

Abstract

Overwhelming majority of econometric models applied on a long term basis in the financial forex market do not work sufficiently well. The reason is that transaction costs and arbitrage opportunity are not included, as this does not simulate the real financial markets. Analyses are not conducted on the non equidistant date but rather on the aggregate date, which is also not a real financial case. In this paper, we would like to show a new way how to analyze and, moreover, forecast financial market. We utilize the projections of the real exchange rate dynamics onto the string-like topology in the OANDA market. The latter approach allows us to build the stable prediction models in trading in the financial forex market. The real application of the multi-string structures is provided to demonstrate our ideas for the solution of the problem of the robust portfolio selection. The comparison with the trend following strategies was performed, the stability of the algorithm on the transaction costs for long trade periods was confirmed.

Figures (15)

FIG. 1: The scheme of the trading strategy for the prediction model based on the deviations from the closed string/pattern form

FIG. 2: The typical distributions evaluated by the Predictors Evaluator module. (a) The distributions of the incoming (h1)
and outgoing (h2) momenta values M. (b) The distributions of S$“) = +1 (hs) and S$“) = —1 (hs_) as dependence on the
interval (he — hop), both distributions are normalized to the interval (0,1) (for more details see Sec. IV). — FIG. 2: The typical distributions evaluated by the Predictors Evaluator module. (a) The distributions of the incoming (h1) and outgoing (h2) momenta values M. (b) The distributions of S$“) = +1 (hs) and S$“) = —1 (hs_) as dependence on the interval (he — hop), both distributions are normalized to the interval (0,1) (for more details see Sec. IV).

FIG. 3: The net asset value of the simulation (J; = 1000, Q = 1, m = 1) on the EUR/USD currency rate for a selected time
period as the dependence on the ask-bid data spread. — FIG. 3: The net asset value of the simulation (J; = 1000, Q = 1, m = 1) on the EUR/USD currency rate for a selected time period as the dependence on the ask-bid data spread.

FIG. 4: (a) The ask—bid spread in the evaluated OANDA data, (b) the typical distribution of the execution orders per day.
Both histograms are for the time period from 2010/07/15 to 2010/10/15. — FIG. 4: (a) The ask—bid spread in the evaluated OANDA data, (b) the typical distribution of the execution orders per day. Both histograms are for the time period from 2010/07/15 to 2010/10/15.

FIG. 5: The net asset value of the model on the EUR/USD currency rate for a selected time period as the dependence on th
string length parameter and the power constant. — FIG. 5: The net asset value of the model on the EUR/USD currency rate for a selected time period as the dependence on th string length parameter and the power constant.

FIG. 6: The net asset value of the model on the EUR/USD currency rate for a selected time period as the dependence on the
power constant.
() = 1,2,4,8, Fig. 6a represents the dependence on higher values, i. e., Q = 1, 16, 24,32. The comparison of the value
() = 24, which seems to be most suitable for the next forecasts, with the simultaneous use of three values is shown in
Fig. 6b. — FIG. 6: The net asset value of the model on the EUR/USD currency rate for a selected time period as the dependence on the power constant. () = 1,2,4,8, Fig. 6a represents the dependence on higher values, i. e., Q = 1, 16, 24,32. The comparison of the value () = 24, which seems to be most suitable for the next forecasts, with the simultaneous use of three values is shown in Fig. 6b.

TABLE I: The values of string parameters used for the PMBCS Simple and Self-learning models. The square brackets emphasize
the fact that the Simple model works with exactly one value of the analyzed string parameter values, while the Self-learning
model can work with sets of parameters simultaneously. — TABLE I: The values of string parameters used for the PMBCS Simple and Self-learning models. The square brackets emphasize the fact that the Simple model works with exactly one value of the analyzed string parameter values, while the Self-learning model can work with sets of parameters simultaneously.

FIG. 7: The net asset value of the model on the EUR/USD currency rate for a selected time period as the dependence on the
periodic function Fos, (a) the comparison of the forecasts for the functions cos(«), sin(a), sinh(«), cosh(x), (b) the comparison
of the forecasts for the functions sin(z + ¢), where ¢ = 0,7. — FIG. 7: The net asset value of the model on the EUR/USD currency rate for a selected time period as the dependence on the periodic function Fos, (a) the comparison of the forecasts for the functions cos(«), sin(a), sinh(«), cosh(x), (b) the comparison of the forecasts for the functions sin(z + ¢), where ¢ = 0,7.

FIG. 8: The split of the trade command predictions for selected time periods as the dependence on the number of parallel
predictor computations ns, left column (a),(c) with ns = 1, right column (b),(d) with ns =8 . — FIG. 8: The split of the trade command predictions for selected time periods as the dependence on the number of parallel predictor computations ns, left column (a),(c) with ns = 1, right column (b),(d) with ns =8 .

FIG. 9: The histograms of the average values of the execution reports for the Self-learning model with n; = 2 (left column) and
ns = 16 (right column). The time period of the simulation is three months for (a), (b) and one year for (c), (d). — FIG. 9: The histograms of the average values of the execution reports for the Self-learning model with n; = 2 (left column) and ns = 16 (right column). The time period of the simulation is three months for (a), (b) and one year for (c), (d).

FIG. 10: The EUR/USD currency rate ticks from 2010/07/15 10:00:00 (blue dots) with the evaluated values of summarized
prediction (red dots), normalize to +1 (a). The detail of subfigure (a) for a very short time period (b), the evaluated string
momenta values (red) with the currency rate ticks on the background (blue). — FIG. 10: The EUR/USD currency rate ticks from 2010/07/15 10:00:00 (blue dots) with the evaluated values of summarized prediction (red dots), normalize to +1 (a). The detail of subfigure (a) for a very short time period (b), the evaluated string momenta values (red) with the currency rate ticks on the background (blue).

TABLE II: The comparison of the results for the net asset values (NAV) of our model and basic time series forecasting models
(ARIMA) and trading strategies (SCALPER, MACD) on the EUR/USD currency rate for the time period 2010/07/15 —
2010/10/15. Mean p is the average of the values (reference point 10°), o is the standard deviation and NAV is the percentage
change of the start and end positions for the selected time period (see also Fig. 11). — TABLE II: The comparison of the results for the net asset values (NAV) of our model and basic time series forecasting models (ARIMA) and trading strategies (SCALPER, MACD) on the EUR/USD currency rate for the time period 2010/07/15 — 2010/10/15. Mean p is the average of the values (reference point 10°), o is the standard deviation and NAV is the percentage change of the start and end positions for the selected time period (see also Fig. 11).

FIG. 11: The net asset value of the model on the EUR/USD currency rate for a selected time period compared with the values
of basic time series forecasting models (ARIMA) and trading strategies (SCALPER, MACD) (see also Tab. IJ). — FIG. 11: The net asset value of the model on the EUR/USD currency rate for a selected time period compared with the values of basic time series forecasting models (ARIMA) and trading strategies (SCALPER, MACD) (see also Tab. IJ).

FIG. 12: The net asset value of the model on the EUR/USD currency rate for a selected time period as the dependence on the
sets of trade parameters ns and as the dependence on the statistical methods (return volatility). — FIG. 12: The net asset value of the model on the EUR/USD currency rate for a selected time period as the dependence on the sets of trade parameters ns and as the dependence on the statistical methods (return volatility).

FIG. 13: The examples of the currency data map for EUR/USD OANDA data represented by the 2-end-point string map
PO (r, h) (a) and the conjugate variable Kr, h) (b). The calculation carried out for 1; = 1000, g = 1 at some time instant. — FIG. 13: The examples of the currency data map for EUR/USD OANDA data represented by the 2-end-point string map PO (r, h) (a) and the conjugate variable Kr, h) (b). The calculation carried out for 1; = 1000, g = 1 at some time instant.

With string model to time series forecasting Richard Pinčák1, ∗ and Erik Bartoš2, † arXiv:1511.00483v1 [q-fin.ST] 2 Nov 2015 1 Institute of Experimental Physics, Slovak Academy of Sciences, Watsonova 47, 043 53 Košice, Slovak Republic 2 Institute of Physics, Slovak Academy of Sciences, Dúbravská cesta 9, 845 11 Bratislava, Slovak Republic (Dated: November 23, 2015) Overwhelming majority of econometric models applied on a long term basis in the financial forex market do not work sufficiently well. The reason is that transaction costs and arbitrage opportunity are not included, as this does not simulate the real financial markets. Analyses are not conducted on the non equidistant date but rather on the aggregate date, which is also not a real financial case. In this paper, we would like to show a new way how to analyze and, moreover, forecast financial market. We utilize the projections of the real exchange rate dynamics onto the string-like topology in the OANDA market. The latter approach allows us to build the stable prediction models in trading in the financial forex market. The real application of the multi-string structures is provided to demonstrate our ideas for the solution of the problem of the robust portfolio selection. The comparison with the trend following strategies was performed, the stability of the algorithm on the transaction costs for long trade periods was confirmed. PACS numbers: 11.25.Wx, 89.65.Gh, 89.90.+n Keywords: string theory, time-series forecast, econophysics, trading strategy, oanda market I. INTRODUCTION It has often been argued and there is corroborating empirical evidence which suggests that stock market prices exhibit wave like properties [1]. In classical financial mathematics there were conducted fundamental investigations [2, 3] to find adequate stochastic processes matching the real financial data: Brownian [4], geometric Brownian [5], general Lévy processes [6]. From the point of view of quantum-like approach [7], the problem cannot even be formulated in such a way, due to the lack of the description of the whole financial market as one single space [8]. The financial markets have their own intrinsic geometrical structure [9, 10], but there is no classical stochastic process which would match with the real financial data. Despite some early studies to characterize the nature of topological space of time series data by identifying the state of a financial market [11, 12] as a complex non-stationary system, there exists no definition of the space of time series data which satisfies the definition of topological space with the fixed-point property, i. e., it fulfills the T0 – Kolmogorov separation axiom [13]. In other words one financial time series are composed of infinitely many random variables in which the average of all of this random variables not always converge to single variable (single Kolmogorov space). Moreover, the fluctuations in the real market are not Gaussian [14] unless they decay as fast as Gaussian, and the history shows that the catastrophic loss (gain) is more likely than the Gaussian model. The instantaneous return on the Financial Times-Stock Exchange (FTSE) and all Share Index is viewed as a frictionless particle moving in one-dimensional square well (potential barrier) but where there is a non-trivial probability of the particle tunneling into the well’s retaining walls with some prices uncertainty [15–18]. The potential barrier approach is frequently used for barrier option calculations in the stock market [19]. Combinatorial auctions on financial game theory seem to be the most promising field. The promising quantum-like experiments give rise to commercial implementation of quantum auctions in the near future [20]. By including a sufficient amount of hidden variables, stochastic models may be able to reproduce the historic observations with similar accuracy. Indeed, the (possible) existence of hidden variables was at the heart of the early critique of quantum theory. It is almost self-evident that the return on a stock depends on many factors that have not been modeled. The question nevertheless is not just one of having a more efficient description of the dynamics. Unlike hidden variables describing physical phenomena, the factors that influence the dynamics of a stock are expected to change over time. Moreover, it is not clear how economic factors such as gross national product influence the value of any given stock at any given point in time. The observed scaling of the return distributions for various stocks in different economic environments ∗ Electronic † Electronic address: pincak@saske.sk address: erik.bartos@savba.sk 2 strongly suggests that all these “hidden” factors find their expression in the mean return and the variance of the returns. In the economics a problem of the asymmetric information of inefficiency of efficiency market hypothesis still exists. This problem was solved by some micro economists by using the duality theory of Walrasian utility function [21] but it is still present in the macro economics time series as so called memory process in the time series of stock price. The dynamics that determines the shape of the return distributions, on the other hand, must be self-consistent and largely immune to the influence of “hidden” variables that are specific for a company and economic and political climate. Hidden variables in our model is an analogy with hidden dimensions in string theory, both are non-visible and non-measurable, however, they have great influence on the whole system. Especially for the learning of buying and selling signals for the currency pairs as well in financial trading, the position is a binding commitment to buy or sell a given amount of financial instruments. Open positions remain subject to fluctuations in the exchange rate. Open positions are closed by entering into a trade that takes the opposite position to the original trade. The net effect is to bring the total amount for currency pair back to zero. The bid price (pbid (τ )) is always less than the ask price (pask (τ )) because brokers pay less than they receive for the same currency pair. The spread represents your cost to trade with a broker. The currency pair p(τ ) indicates how much of the quote currency is required to purchase one unit of the base currency, particular currency, which comprises the physical aspects of a nations money supply. In our string theory approach the hidden variables are ls – the length of momentum string, Q – the quotient or the exponent of the momentum, m – the frequency of the momentum function and will be described in the next sections. Almost all known econometric models applied on a long term basis in the financial forex market do not work sufficiently well, as was shown in [22]. The main goal of this paper is the practical demonstration of the usability of the previously derived model in the real market conditions, e. g., online trading with real data. For this purpose we have used the developing version of the online trade system [23] with tick-data level accuracy of simulations. The OANDA database[24] has served as a source of the data, apart from the previous forex market data. As the complex multi-string structures produced by the generalized derivatives of strings cannot be easily grasped by the intuitive principles we have tried to provide the real application of our string approach with a lot of illustrative examples. The proposed string model algorithm behaviour and the stability on the transaction costs was compared with the well known prediction models and trading strategies [25–28] which serve as the benchmark tests of the more complex time series forecasting models. In Section II, the algorithm of the trade system, the prediction model and the main motivation are presented. In Section III, the analyses of the model with a more detailed description and the results are commented. The prediction of long-run profit by means of a spin of strings is sketched in Section IV. In the last Section V, the conclusions are summarized. II. METHOD The results of our paper are based on the previous work, where the prediction model based on the deviations from the closed string/pattern form (PMBCS) was derived. The model was tested on the forex market historical data with some interesting results which have confirmed our string theory approach. Here we present only the basic facts important for further understanding, the details and the general overview of the problems are given in [22, 29, 30]. All analyses were done in order to maintain a positive value of the net asset value (NAV) for the longest period of time. For all simulations the ask-bid data spreads are included into consideration. It means that all presented results for NAV include also the transaction costs induced by data spreads [31, 32], if not mentioned otherwise. Other types of transaction costs, e. g., brokers’ commissions are not taken into account. The build-in algorithm of the NAV evaluation is as similar as possible to the OANDA algorithm, it was tested on the real data and the deviations were about 0.5 % only. A. PMBCS prediction model In the PMBCS model the incoming currency rates data are transformed into the one dimensional objects “strings”; mathematically, they are the scalar functions of several variables, which are represented by the momentum of the string. We have defined the momentum of the string as !1/Q ls Q 1 X (1) pstand (τ, h, ls ) − FCS (h, ls ) M(ls ,m,Q,FCS ) = ls + 1 h=0 3 where p(τ + h) − pmin (τ, ls ) , pmax (τ, ls ) − pmin (τ, ls ) pmax (τ, ls ) = max p(τ + h) , pstand (τ, h, ls ) = h∈{0,1,2,...,ls } pstand ∈ (0, 1), pmin (τ, ls ) = (2) min h∈{0,1,2,...,ls } p(τ + h), h denotes a tick lag between currency quotes p(τ ) and p(τ + h), τ is the index of the quote. FCS (h, ls ) is the regular function FCS (h, ls ) = 1 1 + cos(ϕ̃) , 2 ϕ̃ = 2πmh + ϕ, ls + 1 (3) ϕ is a phase of a periodic function cos(ϕ̃). The periodic function cos(ϕ̃) in the definition of the regular function, Eq. (3), could be substituted by different types of mathematical functions. The momentum defined above takes the values from the interval M(ls ,m,Q,ϕ) ∈ (0, 1). In Sec. III, the behaviour of the momentum with some explicit examples of the regular function FCS is discussed. B. Trading algorithm The main purpose of our work was to demonstrate the usability of the PMBCS model under the different circumstances, as in the previous cases. The results were expected to be performed on the OANDA data with the most realistic trade conditions. For this purpose, the developing version of the trade online system was used [23]. It provides highly professional algorithmization of the trading strategies with tick-data level accuracy of simulations. For the PMBCS prediction model the trade system has worked with defined trading strategy, its trading algorithm is schematically reviewed in Fig. 1. Various sources serve as input data for the algorithm, e. g., real time data or historical data of currency rates, we have chosen the second one. Data are handled by the current prediction model (PMBCS Prediction Model). In the heart of the algorithm there lies a momentum calculator module (Moment Predictors). The module calculates the values of the momentum predictors, which determine the values for the string parameters of the algorithm (Optimal String Parameters). These values directly affect the trade positions (Open/Close Trade positions). This is the most direct way. But the algorithm works also in the parallel way. The values of the momentum predictors are statistically evaluated with the results of the open/close trade positions and compared with predicted values (Predictors Evaluator). Then the “optimal” values of the momenta are sent ahead to provide the “optimal” values of the string parameters (Fig. 2a). Beside the first set of parameters, there exists also the second set of parameters, which controls the risk of the algorithm. They are called the trade strategy parameters and in our case they are represented by a maximum number of simultaneously opened trades, skewness of momenta distribution and Sharpe ratio of closed trades. Together, the trade strategy parameters and “optimal” string parameters determine the final opening and closing of trade positions. The left side of the scheme outlines future Strategy Evaluator. Its purpose is to evaluate the trade strategy parameters and this way to control the risk in a more sophisticated way. However, in this paper, the values of the trade strategy parameters were fixed to reasonable constants throughout the simulations, e. g., the maximum number of opened trades was set to 10/hour. The main reason is that the online process of finding the best strategy parameters is time consuming and needs intense computing power, which was not able at the time of article preparation. On the other hand, as the trade strategy parameters are the matter of real online trade system, they are not so important at the level of algorithm development. III. PMBCS MODEL RESULTS As was mentioned in the previous section, the online trade system operates with two kinds of parameters, there are trade strategy parameters and optimal string parameters. We have not pretended to determine all parameters, as the process of the online trading is robust and it requires a verified and confident system. Our purpose was to demonstrate how the string parameters influence the results and to find out the method how they can be optimized in a simulation process. However, the trade strategy parameters were chosen as close as possible to the real ones. 4 FIG. 1: The scheme of the trading strategy for the prediction model based on the deviations from the closed string/pattern form. EUR/USD EUR/USD 3 ×10 3 3 × 10 × 10 200 120 h1 180 Entries 3167484 Mean 0.3592 100 160 hS+ 350 Entries 3154935 Mean 0.1303 300 140 80 h2 60 Entries 1799518 Mean 0.3655 250 hS- 120 Entries 3154935 Mean 0.8425 100 200 150 80 40 60 100 40 20 50 20 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 M value 0 0 0.1 (a) 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0 0.9 1 Unit interval (b) FIG. 2: The typical distributions evaluated by the Predictors Evaluator module. (a) The distributions of the incoming (h1 ) and outgoing (h2 ) momenta values M . (b) The distributions of S (n) = +1 (hS+ ) and S (n) = −1 (hS− ) as dependence on the interval (hcl − hop ), both distributions are normalized to the interval (0, 1) (for more details see Sec. IV). A. Empirical analyses As the starting point of our prediction simulations we considered the string parameters which describe the momentum function M(ls ,m,Q,FCS ) (see Eq (1)): ls – the length of momentum string (the number of ticks or time period), Q – the 5 EUR/USD 3 106 ×10 EUR/USD 3 102 105 104 101.5 without spread Ask-Bid data spread without spread with spread 101 103 with spread 100.5 Net asset value Net asset value ×10 Ask-Bid data spread 102 101 100 99 100 99.5 99 98 98.5 97 98 2010/09/05 96 2010/07/15 2010/08/07 2010/08/30 2010/09/22 2010/09/07 2010/09/08 2010/09/09 2010/09/10 Trading dates 2010/10/15 Trading dates FIG. 3: The net asset value of the simulation (ls = 1000, Q = 1, m = 1) on the EUR/USD currency rate for a selected time period as the dependence on the ask-bid data spread. EUR/USD EUR/USD 14 6 Entries 387 Entries 3168484 10 12 5 Execution orders per day 10 104 3 10 10 8 6 4 2 10 2 10 0 0.0005 0.001 0.0015 0.002 Data spread (time period 2010/07/15 - 2010/10/15) (a) 0 2010/06/30 2010/10/30 Trading dates (b) FIG. 4: (a) The ask–bid spread in the evaluated OANDA data, (b) the typical distribution of the execution orders per day. Both histograms are for the time period from 2010/07/15 to 2010/10/15. quotient or the exponent of the momentum (a deformation of string), m – the frequency of the momentum function and last but not least, the momentum function depends also on the FCS (h, ls ) – the regular function. We illustrated the impact of these parameters and the prediction behavior of our model on the net asset value. The simulations were carried out on the OANDA data for EUR/USD currency rate for the time period of three months, from 2010/07/15 to 2010/10/15. The typical background of the simulations is presented in Figs. 2–4. Figure 3 shows the dependence in the three month simulation on the ask-bid data spread, i. e., on the transaction costs (Fig. 4a). Also for long trade periods our algorithm behaves very stably, the values of NAV do not show the high dependence of the transaction costs. The distribution of the trades is nearly uniform throughout all time period, it varied in the range from 0 to 14 per day (Fig. 4b). There are no rapid increases and decreases of the amounts of the trades. The string statistics in the Predictors Evaluator module is represented by Fig. (2a) by the typical distribution of evaluated momenta M . The distributed momenta incoming and outgoing from the module are represented by the histograms h1 and h2 . The figure shows that the selected momenta M with the expected values (the mean) from the interval (0.3, 0.4) are actually crucial in the predictions. Their treatment in the first steps of the evaluation can define the next course of the simulation and to shorten the computation time. In Figs. 5–7 we described some interesting results relating to the string parameters. The net asset value of the model dependence on the string length parameter ls is presented in Fig. 5a. As one can see, the value of ls = 900 seems to be most promising, this value was fixed for the next predictions. The dependence of the model on the parameter Q – the quotient of the moment is described on the next figures. Figure 5b represents the dependence on low values, i. e., 6 EUR/USD 106 ×10 EUR/USD 3 107 ×10 3 Power constant 105 106 104 105 103 104 Q=1 Q=2 Q=4 Net asset value Net asset value Q=8 102 101 100 String length 99 103 102 101 100 800 900 98 99 1000 1100 97 98 96 97 2010/07/15 2010/08/07 2010/08/30 2010/09/22 2010/10/15 2010/07/15 2010/08/07 2010/08/30 2010/09/22 Trading dates 2010/10/15 Trading dates (a) (b) FIG. 5: The net asset value of the model on the EUR/USD currency rate for a selected time period as the dependence on the string length parameter and the power constant. Q = 1, 2, 4, 8, Fig. 6a represents the dependence on higher values, i. e., Q = 1, 16, 24, 32. The comparison of the value Q = 24, which seems to be most suitable for the next forecasts, with the simultaneous use of three values is shown in Fig. 6b. EUR/USD 107 ×10 EUR/USD 3 105 ×10 3 Power constant 106 104 Q= 1 Q = 16 105 103 Q = 24 Q = 32 102 Net asset value Net asset value 104 103 102 101 101 100 99 100 98 99 97 Power constant Q = 24 Q = 16, 24, 32 98 96 97 2010/07/15 95 2010/08/07 2010/08/30 2010/09/22 2010/10/15 2010/07/15 2010/08/07 2010/08/30 Trading dates (a) 2010/09/22 2010/10/15 Trading dates (b) FIG. 6: The net asset value of the model on the EUR/USD currency rate for a selected time period as the dependence on the power constant. The interesting case is the choice of the regular function FCS . The previous forecasts [22, 29] were made for a trigonometric function cos(x). In this paper, we presented the results of the tests with other functions, as one can see in Fig. 7. The comparisons of the forecast with the function cos(x) were made for forecasts with functions sin(x), sinh(x) and cosh(x) in Fig. 7a. The subfigure Fig. 7b represents the comparison of forecasts for the function sin(x) with the different arguments, i. e., x and x + φ, where φ = 0, π. B. Self-learning model The presented forecasts only partially show the possibilities of our algorithm. The values of string parameters, which were used in previous forecasts, are summarized in Table I in a simple model column. For each forecast we 7 EUR/USD 106 ×10 EUR/USD 3 106 ×10 3 Periodic function 105 105 cos(x) sin(x) 104 104 sinh(x) cosh(x) 103 Net asset value Net asset value 103 102 101 100 102 101 100 99 99 98 98 Periodic function sin(x) sin(x+φ) 97 97 96 2010/07/15 96 2010/08/07 2010/08/30 2010/09/22 2010/10/15 2010/07/15 2010/08/07 2010/08/30 2010/09/22 Trading dates 2010/10/15 Trading dates (a) (b) FIG. 7: The net asset value of the model on the EUR/USD currency rate for a selected time period as the dependence on the periodic function FCS , (a) the comparison of the forecasts for the functions cos(x), sin(x), sinh(x), cosh(x), (b) the comparison of the forecasts for the functions sin(x + φ), where φ = 0, π. String parameters PMBCS Simple model PMBCS Self-learning model ls Q FCS m φ 800, 900, 1000, 1100 1, 2, 4, 8, 16, 24, 32 cos(x), sin(x), sin(x + φ), sinh(x), cosh(x) 0, 1, 2, 3 0, 3.14 [900] [8, 16, 24, 32] [cos(x + φ)] [0, 1, 2, 3] [0, 3.14] TABLE I: The values of string parameters used for the PMBCS Simple and Self-learning models. The square brackets emphasize the fact that the Simple model works with exactly one value of the analyzed string parameter values, while the Self-learning model can work with sets of parameters simultaneously. used one value for each type of parameters: ls , Q, FCS , m and φ, respectively. In other words, only one set of string parameters is used, we have denoted it by ns = 1. However, the algorithm can work with various sets of free parameters simultaneously (ns ≥ 1). This is possible due to the fact that the model was enhanced to the so called Self-learning model. In terms of the string parameters from Table I, all possible combinations of the values from the third column are taken into account and the corresponding momentum predictors are calculated. As was said before in Sec. II B, the values of momentum predictors under consideration are statistically evaluated (Predictor Evaluator module). For this purpose, the statistical quantity called the Sharpe ratio was introduced [30], it is defined as S= E(R − Rf ) , σ (4) where R is the asset return, Rf is the return on a benchmark asset (risk free), E(R − Rf ) is the mean value of the excess of the asset return over the benchmark return, σ is the standard deviation of the excess of the asset return. The definitions of formulas are N [i] T [i] X 1 X [i] E(R − Rf ) = pj , (R − Rf [i] ), R[i] = N i=1 j=1 v u N u1 X 2 (R[i] − Rf [i] ) , σ =t N i=1 Rf [i] = T X (pj − j ∗ P ), (5) j=1 (6) where N is the number of the total closed positions, T [i] is the number of the accepted trade results (closed positions), P is the unit penalty parameter. The closing of the trade position is influence by the trade altitude parameter. Its value as well the value of the unit penalty parameter P were set empirically. 8 (a) (b) (c) (d) FIG. 8: The split of the trade command predictions for selected time periods as the dependence on the number of parallel predictor computations ns , left column (a),(c) with ns = 1, right column (b),(d) with ns = 8 . The Sharpe ratio helps to find the best combinations of string parameters. In a such way, the model is the Selflearning model, the trading works with the optimal parameters up to the moment when the new optimal combination of parameters is found. For this purpose, the Sharpe ratio needs a sufficient amount of predicted momenta to provide reliable optimization. The number of sets of string parameters ns must be fixed at a sufficient number, as one can see from Fig. 8 where the split of the trade command predictions is demonstrated for ns = 1 and ns = 8. The positive values of trade command prediction are in favour of the trade, the negative values are against the trade. The lower value of ns means less statistics available for the finding of optimal parameters. On the other hand, very high values of ns need the corresponding computing power. In Fig. 9, we compared the effect of the higher value of ns for the Self-learning model with ns = 2 (left column) and ns = 16 (right column). The histograms show the average values of the execution reports sent by the model. In the subfigures 9a–9b (three months data), the effect is not seen, however, in the subfigures 9c–9d (one year data) the number of the execution reports is approximately 100 times higher. It means that the algorithm is more flexible in trading for a longer period. The opening and closing of trade positions is determined by statistical evaluation of string momenta. In Fig. 10, such a statistical procedure is demonstrated graphically for ns = 16. Each new price tick leads to the evaluation of string momenta, in our case to sixteen values equal to −1, 0, 1. One can see the distribution of evaluated values in Fig. 10b for a very short time period. A few positive values and one negative are clearly visible, the others are equal to zero. Then in Fig. 10b the red dot represents the summarized value of the evaluated string momenta normalized to ±1. The blue dots are the EUR/USD currency rate ticks from 2010/07/15 10:00:00 up to the first 5 × 105 ones. On the left of the subfigure one can see the instant of the “learning”, when the statistics is gained and the momenta do not predict any values. Later the algorithm is fully operational. The string PMBCS Self-learning model was benchmarked against the basic time series forecasting models and trading strategies. For this purpose we chosen the scalping strategy of taking profits on small price changes (SCALPER) [25], a trend-following momentum indicator Moving Average Convergence Divergence (MACD) based on the exponential moving averages (EMA) [26] and finally the class of the autoregressive integrated moving average models (ARIMA) [27, 28], including ARIMA(0,0,0)+c – the mean constant model, ARIMA(0,1,0) – the random walk model and ARIMA(0,1,0)+c – the random walk with drift model (for different constants c). All mentioned models were implemented into the trade system as the corresponding algorithms. The results of the simulations are presented in Tab. II and Fig. 11. As one can see the predicted NAV values after three months time period are close to zero profit for the most of the cases, especially for ARIMA models. Also the mean µ of NAV values oscillates around the starting point throughout the whole period. It is not surprising, one expects such behaviour for the random walk models where predicted values are equal to the last observed values. The reliability of the SCALPER method for examined period is also very small as it tends to the negative profit very rapidly. However, the scalping strategy is primary intended to take small profits for short time scales. 9 EUR/USD EUR/USD 40 25 Entries Mean RMS 35 371 1.317 0.04262 Entries Mean RMS 389 1.321 0.04372 20 30 25 15 20 10 15 10 5 5 0 1.2 1.25 1.3 1.35 1.4 0 1.2 1.45 1.5 Average price 1.25 1.3 1.35 (a) 1.4 1.45 1.5 Average price (b) EUR/USD EUR/USD 160 10000 Entries Mean RMS 140 3595 1.361 0.06008 Entries Mean RMS 9000 309630 1.326 0.0597 8000 120 7000 100 6000 80 5000 4000 60 3000 40 2000 20 0 1.2 1000 1.25 1.3 1.35 1.4 0 1.2 1.45 1.5 Average price 1.25 1.3 1.35 (c) 1.4 1.45 1.5 Average price (d) Exchange rate / Predictor Value FIG. 9: The histograms of the average values of the execution reports for the Self-learning model with ns = 2 (left column) and ns = 16 (right column). The time period of the simulation is three months for (a), (b) and one year for (c), (d). 1.5 1 0.5 0 -0.5 880 875 -1 16 870 14 12 10 Predic 860 8 tor No (a) 6 . 4 855 2 865 tick ce Pri 0 850 (b) FIG. 10: The EUR/USD currency rate ticks from 2010/07/15 10:00:00 (blue dots) with the evaluated values of summarized prediction (red dots), normalize to ±1 (a). The detail of subfigure (a) for a very short time period (b), the evaluated string momenta values (red) with the currency rate ticks on the background (blue). 10 (a) (b) FIG. 11: The net asset value of the model on the EUR/USD currency rate for a selected time period compared with the values of basic time series forecasting models (ARIMA) and trading strategies (SCALPER, MACD) (see also Tab. II). EUR/USD ×10 EUR/USD 3 106 105 105 104 104 103 103 Net asset value Net asset value 106 102 101 100 99 ns=2 ns=16 97 96 2010/07/15 3 102 101 100 99 Sets of parameters 98 ×10 Sharpe ratio 98 standard deviation σ 97 return volatilitiy σ r 96 2010/08/07 2010/08/30 2010/09/22 2010/10/15 2010/07/15 2010/08/07 2010/08/30 2010/09/22 Trading dates 2010/10/15 Trading dates (a) (b) FIG. 12: The net asset value of the model on the EUR/USD currency rate for a selected time period as the dependence on the sets of trade parameters ns and as the dependence on the statistical methods (return volatility). Model String PMBCS SCALPER MACD Mean µ Sigma σ NAV [%] 741 −1396 −383 1482 1340 1020 4.33 −3.99 1.35 Model ARIMA(0,0,0) + c1 ARIMA(0,1,0) ARIMA(0,1,0) + c2 ARIMA(0,1,0) + c3 Mean µ Sigma σ NAV [%] 95 −247 −1699 44 249 805 1208 679 0.40 −0.67 −3.04 0.89 TABLE II: The comparison of the results for the net asset values (NAV) of our model and basic time series forecasting models (ARIMA) and trading strategies (SCALPER, MACD) on the EUR/USD currency rate for the time period 2010/07/15 – 2010/10/15. Mean µ is the average of the values (reference point 105 ), σ is the standard deviation and NAV is the percentage change of the start and end positions for the selected time period (see also Fig. 11). 11 (a) (b) FIG. 13: The examples of the currency data map for EUR/USD OANDA data represented by the 2-end-point string map (2) (2) P1 (τ, h) (a) and the conjugate variable X1 (τ, h) (b). The calculation carried out for ls = 1000, q = 1 at some time instant. C. Volatility The risk of the algorithm is controlled by the trade strategy parameters, see Sec. II B. Their precise determination is the matter of the empirical analysis and the setting of the degree of risk. However, we tried to find out how the Sharpe ratio (Eq. 4) can influence the trading strategy. We focused on the volatility which refers to the standard deviation of currency returns of a financial instrument within a specific time horizon described by one half of the string length ls /2 (see Sec. 6 in [30]). The return volatility at the time scale ls /2 is defined by σr (ls /2) = q r2 (ls /2) − r12 (ls /2), rm (ls ) = ls /2 X h=1 p(τ + h) − p(τ + h − 1) p(τ + h) m , m = 1, 2. (7) The brief result for the prediction dependence on the Sharpe ratio is presented in Fig. 12. The set of string parameters was fixed at value ns = 16, the time period was from 2010/07/15 to 2010/10/15. The subfigure 12a describes the dependence on the standard sigma σ as was always before, the subfigure 12b describes the influence of σr . The enhancement of the results is rather small, but it shows us that the investigation of Sharpe ratio dependencies must be taken into account seriously. The next improvement of the Self-learning model (by sophisticated evolutionary algorithms, statistical evaluation) could help to find optimal string parameters. On the other hand, the recent results, which did not choose the volatility of the market as a model indicator, e. g., the methodology of scale of market quakes [33], are quite promising and maybe the future version of the algorithm could show interesting progress in this field. IV. SPIN AS A PROFIT FOR LONG POSITION As another application of the string theory approach we would like to sketched the model based on the 2-end-point (2) (2) string maps Pq (τ, h) and their conjugate variables Xq (τ, h) (see Sec. 3 in [30]). The choice for the variable X is cleared from Fig. 13. In comparison to the 2-end-point string map P its behaviour is smoother and hence it is less noisy. It highlights only the highest events in the market, so it seems to be the better object for the treatment of the statistics in our predictions. Discrete dynamical rules are implemented where the string state is sequentially transferred to the past and stored by means of instant replicas. In this model, the n-th string of replica system is described by the tuple (for the simplification the indices “q” and “(2)” are omitted) h i (n) {XS (τ, h)}; n = 0, 1, . . . , N ; h = 0, . . . , hop ; S (n) ∈ {−1, 1} (8) including string coordinates and additional one spin supplementary variable S (n) . The meaning of the spin is the same as in particle physics where there are two possibilities for the spin orientation of particle [34]. Suppose the 12 long position is opened at the quote hop and closed at hcl > hop , then S (n) describes profit when S (n) = +1 or loss when S (n) = −1. In the case of buy order the sign of S (n) can be deduced from the price change according to S (n) = sgn(Xb (hcl ) − Xa (hop )). The differences between string states can be measured by the string-string Hilbert Lp -distance as follows 1/p hop dX X (N ) X p 1 (n) = Xj (τ, h) − Xj (τ, h)  , dX hop j=0  Dp(n,N ) (9) h=0 where n, N are the string-replica indices. The fuzzy character of the prediction of the spin variable of the N -th replica is described by means of S (N ) = N red X S (n) w(n,N ) , Nred = N − (hcl − hop ), (10) n=0 which includes Boltzmann-like weights w(n,N ) (N ) (n,N ) exp −cD Dp /Dp , =P (N ) (n′ ,N ) Nred exp −c D /D ′ p D p n =0 (11) where the inter-replica distance is rescaled by the mean (N ) Dp = 1 Nred + 1 N red X Dp(n,N ) . (12) n=0 The example of the distributions S (n) = +1 and S (n) = −1 as dependence on the interval (hcl − hop ) is represented by the histograms hS+ and hS− in Fig. 2b (both distributions are normalized to the interval (0, 1)). In the case of M distribution (Fig. 2a) the relevant values of the mean lay in the close interval near (0.3, 0.4), i. e., too low and high values of M momenta were not important for the determination of Sharpe ratio in the Predictors Evaluator module. It seams that for the spin statistics is the situation opposite, the low values in the S (n) = +1 distribution give the most predictive result. We hope that for the future analyses the spin statistics can bring more valuable results. A. Symbolic dynamics and inter-string information transfer (N ) We postulated dynamics as ordered moves of the data. The moves originate from the initial string Xj transformation of data. The information then passes sequentially along copies index n according to (N ) Xj (N ) (h) ← Xj (h) , ... ... (1) (2) Xj (h) ← Xj (h), (0) (1) Xj (h) ← Xj (h) , (n=0) We see that the information becomes lost at Xj selection of final trades. V. (h) including in the sense of decremented replica S (n) ← sgn(Xb (hcl ) − Xa (hop )), (h) ← Xj (h) , (N −1) Xj (n) Xj S (N −1) ← S (N ) , S (1) ← S (2) , S (0) ← S (1) . (13) . This method could be useful for trading algorithm especially for CONCLUSION It is known by referring to the experimental data published in many journals that the random walk model is a good approximation of the market reality in static situations or in an equilibrium state of the financial market. Whenever 13 such extra-ordinary events take place, especially leading to a financial or economical crisis, the random walk model fails. It means that in case of big instability with big volatility and fluctuation of prices, some other approaches need to be developed to describe and obtain real dynamics of such a market panic behaviour. This work is some attempt to explain and moreover predict financial data with a new string model approach. Market equilibrium comes at the price of commodity for balancing the market forces like demand & supply. In market equilibrium, the amount that the buyers want to buy is equal to the amount that the sellers want to sell. We call this equilibrium when the forces of demand & supply are in balance, whereas there is no reason for a price to rise or fall as long as other factors remain unchanged. It is the situation in which the supply of an item is exactly equal to its demand. Since there is neither surplus nor shortage in the market, price tends to remain stable in this situation. Equilibrium price is also called market clearing price because at this price, the exact quantity that producers take to the market will be bought by consumers, and there will be nothing ‘left over’. For markets to work, an effective flow of information between a buyer and a seller is essential. The flow of information is described in our approach as an interaction part of the energy operator in kinetic and potential energy of the simulated strings. This paper does not cover the full range of possible properties of the string theory approach. In addition to the theoretical modeling, the purpose of the paper was the study of the financial forecasting on the OANDA real data for the PMBCS model as well as the demonstration of some interesting properties of our model on real online trade system. As a result, the abilities of the self-learning model were shown to find the optimal string parameters for the final opening/closing of trade positions. The model was benchmarked against the most used class of the time series forecasting models and trading strategies, the simulations on OANDA data shows the higher values of the profit in comparison with the trend following strategies, e. g. MACD, ARIMA. The stability of the algorithm on the transaction costs for long trade periods was demonstrated. The next logical step is the improvement of the Self-learning algorithm for the data evaluation, as the received results are quite encouraging. For another application of the string approach, we sketched some hierarchical model of algorithmic chemistry from string atoms to string molecules as a method of adaptive boosting. Discrete dynamical rules are implemented where the string state is sequentially transferred to the past and stored by means of instant replicas, as was developed in Sec. IV. We defined a spin of strings which could detect a long-run profit where a fuzzy character of the prediction of the spin variable of the N-th replica can be investigated. Finally, inter-strings information transfer can be analyzed as an analogy with dynamic of prices or currency at specified exchange rate options. And last but not least, the proper algebraic and geometric construction of the space of time series, possible under a Kolmogorov space concept with consistent separation axiom, could help with the analyses of nonstationary time series models, the volatility clustering phenomena in financial time series data or the separation of hidden Markov transition probability state in quantum entaglement state. There are some indications that such a concept could be realized as a topological space with a few hidden states in extradimensions of loop space of time series data. It will be interesting to join hidden states with the empiricaly observed characteristic correlation structures patterns (e. g. [12]), but the other investigations must take place to the future. Acknowledgments The work was supported by the Science and Technology Assistance Agency under Contracts No. APVV-0171-10, No. APVV-0463-12, VEGA Grant No. 2/0037/13 and Ministry of Education Agency for Structural Funds of EU in the frame of the projects 26220120021, 26220120033 and 26110230061. R. Pincak would like to thank the TH division in CERN for hospitality. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] A. Ataullah, M Tippett, Physica A 382 (2007) 557. E. F. Fama, Journal of Finance, Vol. 25 (1970) 383–417. B. B. Mandelbrot, Econometrica, Vol. 38 (1970). B. De Meyer, H. Moussa Saley, Int. J. of Game Theory, Vol. 31, I. 2 (2003) 285–319. B. B Mandelbrot, J. W. van Ness, SIAM Review 10 (4) (1968) 422–437. W. Schoutens, “Lévy Processes in Finance: Pricing Financial Derivatives”, Wiley (2003) 200p. S. Farinelli, http://arxiv.org/abs/0910.1671 A. Khrennikov, Chapter 5, pp. 67–77 in M. Faggini and T. Lux (Eds.), Springer 2009, pp. 177. D. Carfi, http://arxiv.org/abs/1106.1774 K. IIinski, J Phys. A: Math. Gen. 33 (2000) L5-L14. J. D. Hamilton, Econometrica 2 (1989) pp. 357–384. M. C. Münnix et al., Sci. Rep. 2, 644 (2012) 1–6. 14 [13] [14] [15] [16] [17] [18] [19] [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] [31] [32] [33] [34] Z. Karno, Formalized Mathematics, Vol. 5(1) (1996) 119–124. L. Borland, Phys. Rev. Lett. 89 (2002) 0987011. A. Ataullah, I. Davidson, M. Tippett, Physica A 388 (2009) 455. F. Bagarello, Physica A 386 (2007) 283. C. Zhang, L. Huang, Physica A 389 (2010) 5769. P. Pedram, Physica A 391 (2012) 2100. T. K. Jana, P. Roy, Physics A 390 (2011) 2350. E. W Piotrowski, J. Sladkowski, Physica A 387 (2008) 3949. S. G. Hall, Oxf. Econ. Pap. 35 (2) (1983) 247–253. R. Pincak, Physica A, 392 (2013) 6414–6426. The online trade system provided by Librade LTD, http://www.librade.org/our-solution/development/. OANDA data for EUR/USD currency rates from live account, http://www.oanda.com/. M. Dacorogna et al., “An Introduction to High-Frequency Finance”, Academic Press (2001), 383 p. G. Appel, “Technical Analysis: Power Tools for Active Investors”, FT Press (2005), 264. G. Box, G. Jenkins and G. Reinsel “Time-Series Analysis: Forecasting and Control”, 4th edition, Wiley (2008), 784 p. G. Weisang, Y. Awazu, Case Studies in Business, Industry and Government Statistics, Vol. 2, Issue 1 (2008). M. Bundzel, T. Kasanicky and R. Pincak, Computing and Informatics, Vol. 32, (2013), 1001–1015. D. Horvath and R. Pincak, Physica A, 391 (2012) 5172–5188. W. H. Wagner, M. Edwards, Financial Analysts Journal 49 (1993) 65. R. Roll, The Journal of Finance 39 (1984) 1127. T. Bisig, A. Dupuis, V. Impagliazzo, R. B. Olsen, Quantitative Finance, Vol. 12 (2012) 501–508. W. Greiner, “Relativistic quantum mechanics: Wave equations”, Berlin, Germany: Springer (1990) 345 p.

Log In

With string model to time series forecasting

Sign up for access to the world's latest research

Abstract

Figures (15)

Related papers