System Robustness

nizar · 23 August 2007

howardbandy said:
Greetings --

I have written one book about trading system design, testing, and validation -- Quantitative Trading Systems -- and have another planned -- Trading System Validation. I've read through the thread to date, and I'd like to add my two cents worth to the discussion of robustness.

1. In order to measure robustness, there must be a metric, often called an objective function. The metric is personal -- yours will probably be different than mine -- and it must be designed or chosen before the model development process begins in earnest. Some people use compound annual rate of return, others have quite complex objective functions that include terms for trading frequency, annual rate of return, drawdown, holding period, and so forth. The sensitivity of parameters to specific data can be addressed by perturbing the parameter values, measuring the associated objective value, and averaging. Sensitivity to outliers can be addressed by limiting the profit or loss associated with them. These, and other Monte Carlo techniques can be incorporated into the objective function.

There is a very important point about the objective function -- if two trading systems or alternatives are compared over a given set of data, the one that scores highest according to the objective function must be the one that the person developing the system prefers. If that is not the case, then there must be some factor that has not yet been included in the objective function. That factor must be identified, quantified, and incorporated into the objective function before proceeding. Further along in the trading system development process, we will not be able to see all the alternatives -- only the one alternative that scores highest will be used.

2. The data being used must be divided into at least two data sets -- an in-sample data set that is used to select the parameters for the trading system, and an out-of-sample data set that is used very infrequently -- preferably exactly one time -- to measure the performance of the system on data that has never been seen before. If the trading system is adjusted based on the results of the out-of-sample data, then that previously out-of-sample data has just become in-sample data, and a new out-of-sample data set must be used for validation.

3. Search through the in-sample data set as much as you wish. Look for values of the parameters that maximize the value of the objective function; add rules and filters. When you are satisfied with the in-sample results, test on the out-of-sample data. If the out-of-sample results are satisfactory, you have an indication that the system might be robust and might be tradable.

4. To increase your level of confidence, perform the walk-forward test. That is, select a length of time to use as the in-sample length and a length of time to use as the out-of-sample length. The only arrangement of the two periods that is practical is to have the out-of-sample immediately follow the in-sample. Start far enough ago that there is room for several out-of-sample periods before today. Beginning with the first in-sample period (the oldest in-sample period), find the optimum parameters -- optimum is always determined by the highest value of the objective function -- then test over the associated out-of-sample period, and record the results. Move on to the second in-sample period, optimize, test the second out-of-sample period, and record the results. After all the in-sample periods have been processed, concatenate the results from the out-of-sample periods. If those results are satisfactory, your confidence is increased.

5. Contrary to popular belief, there is no statistic or other measurement that can be taken from the in-sample results that give any indication of the likelihood of a system to be profitable in the future. Thirty closed trades is not enough -- 30,000 closed in-sample trades is not enough. The results achieved over the in-sample period are meaningless.

6. I'll say that again -- the result achieved over the in-sample period are meaningless. They are always good. We do not stop searching until they are good.

7. There is no guarantee that any trading system will be profitable in the future. The best we can hope for is a high degree of confidence. The only way to build that confidence is by repeating the optimization - out-of-sample testing process.

8. If you do not have acceptable test results on a truly out-of-sample data period, then out-of-sample testing begins tomorrow with real money.

Thanks for listening,
Howard
www.quantitativetradingsystems.com

Top post there, the bold and underlined sections is just what i thought was key.

Please Howard, visit us more often to share your wisdom

I think I may have to get my hands on your book.

R0n1n · 23 August 2007

howardbandy said:
2. The data being used must be divided into at least two data sets -- an in-sample data set that is used to select the parameters for the trading system, and an out-of-sample data set that is used very infrequently -- preferably exactly one time -- to measure the performance of the system on data that has never been seen before. If the trading system is adjusted based on the results of the out-of-sample data, then that previously out-of-sample data has just become in-sample data, and a new out-of-sample data set must be used for validation.
www.quantitativetradingsystems.com

So can I use the ASX Historical data as an in-sample data for system development and the US Historical data for performance measurement ? Or the vice-versa ??

nizar · 23 August 2007

R0n1n said:
So can I use the ASX Historical data as an in-sample data for system development and the US Historical data for performance measurement ? Or the vice-versa ??

In my view.
Yes.

Another way is to do all your fine tuning and testing from, say, the period 1992-1999, and then once you are happy with results, you forward test it through 1999-2007.

I personally intend to do both.

ie. I will test my system across different timeframes of the same market AND also across different markets (US stocks and other otherseas bourses).

bingk6 · 23 August 2007

R0n1n said:
So can I use the ASX Historical data as an in-sample data for system development and the US Historical data for performance measurement ? Or the vice-versa ??

Or alternatively, segregate the ASX stocks, eg in sample may be ASX100, and out of sample may be ASX200 exclude ASX100 or ASX300 exclude ASX200 etc etc.

Howard, top post. I am curious as to the period one should allocate for the insample testing. Some have commented that you should have bearish, flatish and bullish conditions all within the insample period, which could potentially make it a fairly long period. I am interested in your views on the minimum period one should alocate for insample testing.

nizar · 23 August 2007

bingk6 said:
I am interested in your views on the minimum period one should alocate for insample testing.

Me too.
I dont think there is a minimum period.
It probably depends on how longterm you wish to make your system.

But im keen to hear Howard's thoughts on this matter.

howardbandy · 23 August 2007

Greetings all --

Thanks for the kind words and civilized response -- my views are upsetting to some people.

----

On the question of how to select the in-sample data and how to select the out-of-sample data.

If you are working with a single issue, say a major stock index or a specific commodity, then all of the in-sample data will be coming from one ticker and the question becomes "what periods of time should be used?" The short answer is to use whatever periods of time give good results. My feeling is that every market we model is dynamic -- non-stationary. The characteristics of the market change over time. The most obvious changes are easily measured -- the slope of a moving average, the average true range, the number of overnight gaps, as so forth. Any single version of the trading system we build to model that market is static for the entire period we use it. As long as the underlying market does not "move" too much, the model continues to accurately represent it, and the buy and sell signals continue to be profitable. Eventually the characteristics of the market change and the model is out-of-synch with the market. Maybe the market will return to its earlier state, but usually it will not, and a new model is needed, so we must reoptimize. That is, we must perform the next walk-forward iteration.

The length of time that the model and the market stay in synch determines the maximum length of the out-of-sample period. While there is no theoretical need that all out-of-sample periods be the same length, it is common for them to be. The algorithms to perform the walk-forward testing automatically are easiest to implement when reoptimization is done after a set number of bars. The most often reoptimization can take place is after every bar, and that is acceptable, although computationally intensive. The least often reoptimization can take place is never, and that is acceptable as long as the model and the market stay sufficiently in synch that the trades are profitable.

Which brings up another issue I'll address here briefly, but put off for a later posting. "Will a trading system that once worked, but is now broken, ever return to profitability?" My answer is that all trading system eventually break and that broken systems very seldom return to profitability. Trading systems are unique as models go (in the sense of statistical models of physical processes), in that the act of modeling changes the process being modeled -- every profitable trade made removes some of the inefficiency that the model has identified. Every profitable trade actually made (paper trades do not count) makes it less likely that the next trade will be profitable. For example, the Donchian-style breakout systems that worked so well in the 1970s and 80s no longer work, and will probably never work again.

Back to selecting the length of the in-sample period. There must be a large enough number of data points such that the model can detect some general feature of the market. The relationship between the number of data points and the number of parameters in the model is very similar to the relationship between the number of data points and the degree of a polynomial being fit to them. As the degree of the polynomial increases, the goodness of the fit increases, but the accuracy of the prediction of the next data point does not necessarily increase. The model becomes curve-fit to the data. In some models that is desirable -- a model of the rotation of two stars around a common center of gravity. But in trading systems, there is a peak in the fitness to the in-sample data where the general features of the market have been identified and learned, but the noise has not yet been learned. To deal with this, some model development platforms use three data periods -- the first is the in-sample period used to pick the best parameter values, the second is another in-sample period used to guide the modeling process and determine when the fit has found its peak and stop, and the third is the out-of-sample period used to validate the model.

What is the practical solution? For an end-of-day trading system, try out some different lengths for the in-sample period -- say two years, one year, six months, three months. The best length will depend on the complexity of the model and the stability of that portion of the underlying market that your model is trying to match. What length for the out-of-sample period? Any length shorter than the amount of time it usually takes for the market to shift away from the model. If a model is going to be accurate and profitable, it should immediately be accurate and profitable. So try out-of-sample periods of one month, one week, one day.

I can hear someone saying "But wait, trying all these combinations and then picking the lengths is using results from out-of-sample testing to determine the model." And they are correct. But only the length of the in-sample period is being chosen by optimization, so only four data points are being tested -- two years, one year, six months, three months. But at that, there is still a contamination of the purity of the out-of-sampleness. So be careful.

----

Which leads directly to one of the other questions posted -- can a trading system be developed using one ticker and validated using other tickers? That is, can the in-sample data be one stock and the out-of-sample data be another stock? Yes, but there is a caution here as well.

First, my opinion is that there is no requirement that a trading system must work on every data series in order for it to be considered robust -- that will just never happen. For whatever fundamental reasons prices vary, I have no reason to expect that the share prices of a gold exploration company, an automobile manufacturer, a bank, a food producer, and a soft drink company all act the same. But I should see similarities within sectors -- if a model works well for one insurance company, it should work reasonably well for most other insurance companies, and maybe even for banks.

I can use one company as in-sample data to build a model, then other similar companies as out-of-sample data to validate it.

But avoid what I call "optimizing the ticker space." It is not valid to build a trading model using the data series from one ticker, then test it over, say, 500 different tickers and trade only those 10 of the 500 that were profitable. That is just curve-fitting. The 10 are probably profitable because they got lucky. If 400 were profitable, there is hope.

There is a technique that helps determine which tickers to trade. Develop a model using one ticker over one in-sample period of time, but reserve two out-of-sample periods of time. Test the model on the universe of say 500 tickers for the first out-of-sample period and count the proportion of the 500 that are profitable. Make a separate list of those that are profitable, and test them over the second out-of-sample period. If the proportion that are profitable in the second out-of-sample period is about the same as the proportion that were profitable in the first in-sample period, then the model is not robust. If the proportion that is profitable in the second out-of-sample period is high, then the model is probably robust.

----

Let me clarify one point. Throughout this posting I have been using the term "is profitable." That is shorthand for saying "score a high value using the objective function." All measurements of the merit of a trading system are made by computing the score for the objective function -- not just the profitability.

----

This has become longer, and a little more theoretical, than I intended. I hope it is useful.

Thanks for listening,
Howard
www.quantitativetradingsystems.com

R0n1n · 24 August 2007

howardbandy said:
my views are upsetting to some people.

Why would your view upset someone when they make sense is beyond me.....

tech/a · 24 August 2007

R0n1n said:
Why would your view upset someone when they make sense is beyond me.....

Its common.
Known as the tall Poppy syndrome.
Ive been at the end of it myself
And accused (sometimes rightly) of dishing it out.

I have some questions for Howard but wish to have the time to ask them in full.Perhaps tommorow.

R0n1n · 24 August 2007

ok here is another attempt at getting my system off the ground. The testing universe is ASX200. It was developed on the Nasdaq 100. ( as howard bandy mentioned) . I have pasted the backtest reports from Tradesim and Amibroker. Time Period was 1 year.

Please feel free to comment, good and bad all welcome

howardbandy · 24 August 2007

Greetings --

The first question to ask is always "Are these results from in-sample or out-of-sample periods?"

If the answer is in-sample, then ask to see the out-of-sample results. The in-sample results give absolutely no indication of the likely profitability of trading the system.

If the answer is out-of-sample, then ask to see the equity curve from the concatenated out-of-sample runs. If the equity curve looks like the kind that you would be comfortable to trade, then learn more about the system.

Thanks,
Howard
www.quantitativetradingsystems.com

nizar · 24 August 2007

Well this is (the workings of) my system tested on all ASX stocks from 1992-2002.

Still got alot of work to do to mininise that drawdown

Monte Carlo Report

Trade Database Filename
C:\TradeSimData\Version A.trb

Simulation Summary
Simulation Date: 21/08/2007
Simulation Time: 2:47:32 PM
Simulation Duration: 128.27 seconds

Trade Parameters
Initial Capital: $30,000.00
Portfolio Limit: 100.00%
Maximum number of open positions: 100
Position Size Model: Fixed Percent Risk
Percentage of capital risked per trade: 1.00%
Position size limit: 100.00%
Portfolio Heat: 100.00%
Pyramid profits: Yes
Transaction cost (Trade Entry): $44.00
Transaction cost (Trade Exit): $44.00
Margin Requirement: 100.00%
Magnify Position Size(& Risk) according to Margin Req: No
Margin Requirement Daily Interest Rate (Long Trades): 0.0000%
Margin Requirement Yearly Interest Rate (Long Trades): 0.0000%
Margin Requirement Daily Interest Rate (Short Trades): 0.0000%
Margin Requirement Yearly Interest Rate (Short Trades): 0.0000%

Trade Preferences
Trading Instrument: Stocks
Break Even Trades: Process separately
Trade Position Type: Process all trades
Entry Order Type: Default Order
Exit Order Type: Default Order
Minimum Trade Size: $500.00
Accept Partial Trades: No
Volume Filter: Ignore Volume Information
Pyramid Trades: Yes
Favour Trade Pyramid: Yes
Start Pyramid at any level up to level: N/A
Maximum Pyramid Level Limited to: N/A
Maximum Pyramid Count Limited to: N/A

Simulation Stats
Number of trade simulations: 5000
Trades processed per simulation: 19705
Maximum Number of Trades Executed: 828
Average Number of Trades Executed: 703
Minimum Number of Trades Executed: 580
Standard Deviation: 30.04

Profit Stats
Maximum Profit: $54,269,692.88 (180898.98%)
Average Profit: $5,030,204.63 (16767.35%)
Minimum Profit: $184,202.92 (614.01%)
Standard Deviation: $4,298,807.36 (14329.36%)
Probability of Profit: 100.00%
Probability of Loss: 0.00%

Percent Winning Trade Stats
Maximum percentage of winning trades: 39.91%
Average percentage of winning trades: 34.95%
Minimum percentage of winning trades: 28.93%
Standard Deviation: 1.56%

Percent Losing Trade Stats
Maximum percentage of losing trades: 71.07%
Average percentage of losing Trades: 65.05%
Minimum percentage of losing trades: 60.09%
Standard Deviation: 1.56%

Average Relative Dollar Drawdown Stats
Maximum of the Average Relative Dollar Drawdown: $194,188.18
Average of the Average Relative Dollar Drawdown: $21,420.74
Minimum of the Average Relative Dollar Drawdown: $1,795.50
Standard Deviation: $15,379.53

Average Relative Percent Drawdown Stats
Maximum of the Average Relative Percent Drawdown: 3.1828%
Average of the Average Relative Percent Drawdown: 2.0099%
Minimum of the Average Relative Percent Drawdown: 1.3763%
Standard Deviation: 0.2245%

Maximum Peak-to-Valley Dollar Drawdown Stats
Maximum Absolute Dollar Drawdown: $9,514,549.86
Average Absolute Dollar Drawdown: $731,030.55
Minimum Absolute Dollar Drawdown: $37,114.43
Standard Deviation: $637,463.24

Maximum Peak-to-Valley Percent Drawdown Stats
Maximum Absolute Percent Drawdown: 63.8287%
Average Absolute Percent Drawdown: 32.9436%
Minimum Absolute Percent Drawdown: 14.5495%
Standard Deviation: 7.2958%

nizar · 24 August 2007

And the same system tested over 1997-2007 again with the universe being all ASX stocks.

The drawdown is disgusting

Monte Carlo Report

Trade Database Filename
C:\TradeSimData\version A2.trb

Simulation Summary
Simulation Date: 24/08/2007
Simulation Time: 10:23:59 PM
Simulation Duration: 400.66 seconds

Trade Parameters
Initial Capital: $30,000.00
Portfolio Limit: 100.00%
Maximum number of open positions: 100
Position Size Model: Fixed Percent Risk
Percentage of capital risked per trade: 1.00%
Position size limit: 100.00%
Portfolio Heat: 100.00%
Pyramid profits: Yes
Transaction cost (Trade Entry): $44.00
Transaction cost (Trade Exit): $44.00
Margin Requirement: 100.00%
Magnify Position Size(& Risk) according to Margin Req: No
Margin Requirement Daily Interest Rate (Long Trades): 0.0000%
Margin Requirement Yearly Interest Rate (Long Trades): 0.0000%
Margin Requirement Daily Interest Rate (Short Trades): 0.0000%
Margin Requirement Yearly Interest Rate (Short Trades): 0.0000%

Trade Preferences
Trading Instrument: Stocks
Break Even Trades: Process separately
Trade Position Type: Process all trades
Entry Order Type: Default Order
Exit Order Type: Default Order
Minimum Trade Size: $500.00
Accept Partial Trades: No
Volume Filter: Ignore Volume Information
Pyramid Trades: Yes
Favour Trade Pyramid: No
Start Pyramid at any level up to level: N/A
Maximum Pyramid Level Limited to: N/A
Maximum Pyramid Count Limited to: N/A

Simulation Stats
Number of trade simulations: 10000
Trades processed per simulation: 29314
Maximum Number of Trades Executed: 964
Average Number of Trades Executed: 849
Minimum Number of Trades Executed: 366
Standard Deviation: 36.47

Profit Stats
Maximum Profit: $401,977,157.45 (1339923.86%)
Average Profit: $18,453,235.32 (61510.78%)
Minimum Profit: -$29,504.86 (-98.35%)
Standard Deviation: $21,876,167.05 (72920.56%)
Probability of Profit: 99.96%
Probability of Loss: 0.04%

Percent Winning Trade Stats
Maximum percentage of winning trades: 41.67%
Average percentage of winning trades: 35.58%
Minimum percentage of winning trades: 19.95%
Standard Deviation: 1.56%

Percent Losing Trade Stats
Maximum percentage of losing trades: 80.05%
Average percentage of losing Trades: 64.42%
Minimum percentage of losing trades: 58.33%
Standard Deviation: 1.56%

Average Relative Dollar Drawdown Stats
Maximum of the Average Relative Dollar Drawdown: $433,449.77
Average of the Average Relative Dollar Drawdown: $30,333.38
Minimum of the Average Relative Dollar Drawdown: $517.05
Standard Deviation: $31,205.21

Average Relative Percent Drawdown Stats
Maximum of the Average Relative Percent Drawdown: 7.5050%
Average of the Average Relative Percent Drawdown: 1.9909%
Minimum of the Average Relative Percent Drawdown: 1.2500%
Standard Deviation: 0.2531%

Maximum Peak-to-Valley Dollar Drawdown Stats
Maximum Absolute Dollar Drawdown: $27,645,625.94
Average Absolute Dollar Drawdown: $1,597,883.70
Minimum Absolute Dollar Drawdown: $18,822.11
Standard Deviation: $2,006,702.31

Maximum Peak-to-Valley Percent Drawdown Stats
Maximum Absolute Percent Drawdown: 98.3495%
Average Absolute Percent Drawdown: 40.1098%
Minimum Absolute Percent Drawdown: 15.1293%
Standard Deviation: 11.1830%

R0n1n · 24 August 2007

howardbandy said:
Greetings --

The first question to ask is always "Are these results from in-sample or out-of-sample periods?"

Howard the results are from out of sample data.

nizar My other system too suffers from really bad drawdown.

What can be done to bring it down. Help from anyone would be much appreciated.

howardbandy · 25 August 2007

Greetings --

Sometimes -- drawdowns can be reduced by shortening holding periods. Or, maybe the entries are too early or too late.

Try this:

Look at the length of your average winning trade and average losing trade. Say they are each 10 days. You will be using half that length in the next step. If they have different lengths, try this with both lengths.

Leave all of your buy logic in place. Comment out your sell logic.

Use this for your experimental sell logic:
Sell = BarsSince(Buy) >=5;

Run your backtests.

If your drawdown is now lower, that is an indication that your holding period is too long. Make your exits sooner.

If your drawdown has not changed much or is higher, then your entries need work.

Thanks,
Howard
www.quantitativetradingsystems.com

tech/a · 25 August 2007

howardbandy said:
Greetings --

Sometimes -- drawdowns can be reduced by shortening holding periods. Or, maybe the entries are too early or too late.

Try this:

Look at the length of your average winning trade and average losing trade. Say they are each 10 days. You will be using half that length in the next step. If they have different lengths, try this with both lengths.

Leave all of your buy logic in place. Comment out your sell logic.

Use this for your experimental sell logic:
Sell = BarsSince(Buy) >=5;

Run your backtests.

If your drawdown is now lower, that is an indication that your holding period is too long. Make your exits sooner.

If your drawdown has not changed much or is higher, then your entries need work.

Thanks,
Howard
www.quantitativetradingsystems.com

Howard.
Struggling with this test.

By cutting the exit back to 50% of the average trading period surely this alters the landscape of the system dramatically.
The reason for the exit is no longer valid and now becomes an arbitory look back period. Arbitory as it will vary dramatically lengths of trades.
Those trades which were the trades that caught the trend (Any trend) would be instantly depleated.
All trends regardless of length would be on average cut back dramatically.
All your doing is altering the exit criteria and then applying the results to a system which has a different criteria for exit.Results surely for both are independant of each other.To me its like comparing one system to another and expecting them both to tell me something about each other---??

If your drawdown has not changed much or is higher, then your entries need work.

Getting the head around this as well.
Wouldnt number of exits caused by initial stops be important for entries.
Infact the importance of an entry can quickly disappear the longer a stock is held.The original reason a stock is purchased becomes in significant and the reason you sell it becomes the most important focus.

Are we talking initial system drawdown or Peak to Valley drawdown.?

Arent initial drawdowns more dependant on entry and Peak to Valley Exit?
IE for Initial Drawdown I have found through testing that the optimum value is 7-12% of initial purchase price, regardless of how you set your exit.% or technical point.

Shorter exits tend to increase the number of stops above 20% which has an impact on staying on an emerging trend beyound "noise".
Longer exits cut down stop outs (20% will cut out around 95% in most cases) however tends to trap most stocks in to long periods of no mans land.The point between buy,stop and profit.
There becomes an opportunity cost as trades are stuck doing nothing.

Its been my experience that its at the ENTRY end of the trade where INITIAL drawdowns have an impact and virtually no impact on Peak to Valley D/D
and at the EXIT end where Peak to Valley drawdowns are most affected and initial drawdown virtually un affected.

Interested in your views.

nizar · 25 August 2007

tech/a said:
Its been my experience that its at the ENTRY end of the trade where INITIAL drawdowns have an impact and virtually no impact on Peak to Valley D/D
and at the EXIT end where Peak to Valley drawdowns are most affected and initial drawdown virtually un affected.

Hmmm yeh i thought the same thing as well.
Does tradesim tell you initial drawdown on its own?

tech/a · 26 August 2007

howardbandy said:
Greetings --

I have written one book about trading system design, testing, and validation -- Quantitative Trading Systems -- and have another planned -- Trading System Validation. I've read through the thread to date, and I'd like to add my two cents worth to the discussion of robustness.

1. In order to measure robustness, there must be a metric, often called an objective function. The metric is personal -- yours will probably be different than mine -- and it must be designed or chosen before the model development process begins in earnest. Some people use compound annual rate of return, others have quite complex objective functions that include terms for trading frequency, annual rate of return, drawdown, holding period, and so forth. The sensitivity of parameters to specific data can be addressed by perturbing the parameter values, measuring the associated objective value, and averaging. Sensitivity to outliers can be addressed by limiting the profit or loss associated with them. These, and other Monte Carlo techniques can be incorporated into the objective function.

There is a very important point about the objective function -- if two trading systems or alternatives are compared over a given set of data, the one that scores highest according to the objective function must be the one that the person developing the system prefers. If that is not the case, then there must be some factor that has not yet been included in the objective function. That factor must be identified, quantified, and incorporated into the objective function before proceeding. Further along in the trading system development process, we will not be able to see all the alternatives -- only the one alternative that scores highest will be used.

Although I have done this myself Ive never looked at it as sussinctly as you have put it here.

Excellent

2. The data being used must be divided into at least two data sets -- an in-sample data set that is used to select the parameters for the trading system, and an out-of-sample data set that is used very infrequently -- preferably exactly one time -- to measure the performance of the system on data that has never been seen before. If the trading system is adjusted based on the results of the out-of-sample data, then that previously out-of-sample data has just become in-sample data, and a new out-of-sample data set must be used for validation

3. Search through the in-sample data set as much as you wish. Look for values of the parameters that maximize the value of the objective function; add rules and filters. When you are satisfied with the in-sample results, test on the out-of-sample data. If the out-of-sample results are satisfactory, you have an indication that the system might be robust and might be tradable.

Could this simply be data from another bourse or commodity?

4. To increase your level of confidence, perform the walk-forward test. That is, select a length of time to use as the in-sample length and a length of time to use as the out-of-sample length. The only arrangement of the two periods that is practical is to have the out-of-sample immediately follow the in-sample. Start far enough ago that there is room for several out-of-sample periods before today. Beginning with the first in-sample period (the oldest in-sample period), find the optimum parameters -- optimum is always determined by the highest value of the objective function -- then test over the associated out-of-sample period, and record the results. Move on to the second in-sample period, optimize, test the second out-of-sample period, and record the results. After all the in-sample periods have been processed, concatenate the results from the out-of-sample periods. If those results are satisfactory, your confidence is increased.

Objective function
Highest value of the objective function do you simply mean those conditions which you see as your main objectives being met? Can you elaborate on the "In Black quote?"

5. Contrary to popular belief, there is no statistic or other measurement that can be taken from the in-sample results that give any indication of the likelihood of a system to be profitable in the future. Thirty closed trades is not enough -- 30,000 closed in-sample trades is not enough. The results achieved over the in-sample period are meaningless.

6. I'll say that again -- the result achieved over the in-sample period are meaningless. They are always good. We do not stop searching until they are good.

7. There is no guarantee that any trading system will be profitable in the future. The best we can hope for is a high degree of confidence. The only way to build that confidence is by repeating the optimization - out-of-sample testing process.

I'm feeling like you mean optimisation of Parameters.
Isnt an in sample set of results and in particular a lengthy montecarlo test result supplying you with what I call a "Blueprint" which if your trading falls within the blueprints returned results will return you a profit within the parameters of profit returned in testing? Therefore confidence.

8. If you do not have acceptable test results on a truly out-of-sample data period, then out-of-sample testing begins tomorrow with real money.

Thanks for listening,
Howard
www.quantitativetradingsystems.com

Think I'll become a client as well.
I dont like using Cards over the Nett---been done once---us there another way?Cheque--phone a card number through?

R0n1n · 26 August 2007

optimisation of Parameters

Is it better to optimise paramaters of indicators of a system seperately or do the whole optimisation in one go. The reason I ask is doing it seperately is a bit faster and I can do it over various times while doing the whole system takes considerable time.

tech/a · 26 August 2007

Ron.

I maybe wrong but I dont think Howard is talking about the optimisation of Variables.
Optimisation of parameters I am assuming are the components of Drawdown,String of Losses,Average time held Profit to loss etc,whatever those parameters are that are important to you in your systems developement.

But to your question.
Which comes with another question.
How do you avoid Curve fitting?
What makes you believe optimisation of variables will increase profitability "Walking forward"?

R0n1n · 26 August 2007

tech/a said:
Ron.

I maybe wrong but I dont think Howard is talking about the optimisation of Variables.
Optimisation of parameters I am assuming are the components of Drawdown,String of Losses,Average time held Profit to loss etc,whatever those parameters are that are important to you in your systems developement.

But to your question.
Which comes with another question.
How do you avoid Curve fitting?
What makes you believe optimisation of variables will increase profitability "Walking forward"?

Tech, you just love to make my brain work hard on a sunday

to answer you, to avoid curve fitting one can test it with one market and test it out on another market but not optomise it to that market.

To answer your last question with a question

isin't optimisation of variables a method to just skew the numbers in your favor. But Curve fitting = when you skew numbers to prove your point, to prove that you have nailed the system ??

System Robustness

nizar

R0n1n

nizar

bingk6

nizar

howardbandy

R0n1n

tech/a

No Ordinary Duck

R0n1n

Attachments

howardbandy

nizar

nizar

R0n1n

howardbandy

tech/a

No Ordinary Duck

nizar

tech/a

No Ordinary Duck

R0n1n

tech/a

No Ordinary Duck

R0n1n

Similar threads

Connect with us