I think the target math is some form of multiple regression, like a linear regression but fitting a line to n-dimensions instead of only 2 dimensions. Hoping there may be a name for it, or a name for a better process.
case:
- time series, in days
- an event occurs or not for a given day, so gets a value of 0 or 1
- several other variables (indicator values) that can have any value, which are measured in the early part of the day
goal:
- which of the other variables show some relationship to the event occurring or not?
- is there a formula to get a prediction value?
If the case were one variable predicting the event, I could do something simple, like the variable is higher or lower than a level. In my case, I've currently got 3 variables, and may add more. It seems too simple to say higher or lower for each individually, which is what I can do with backtesting software. Is there math, a stat test, or a process to figure out which variables are important, or if there is some effect from combining them?
I have Eureqa, but usually I don't have much luck when the outcome is 0 or 1. I also have Octave, which may have better functions for this type of task.
thanks in advance -- stat test names or wikipedia articles to get me started are very helpful, and sorry if my description does not make much sense, my statistics knowledge is not very deep, I'll try to study more
case:
- time series, in days
- an event occurs or not for a given day, so gets a value of 0 or 1
- several other variables (indicator values) that can have any value, which are measured in the early part of the day
goal:
- which of the other variables show some relationship to the event occurring or not?
- is there a formula to get a prediction value?
If the case were one variable predicting the event, I could do something simple, like the variable is higher or lower than a level. In my case, I've currently got 3 variables, and may add more. It seems too simple to say higher or lower for each individually, which is what I can do with backtesting software. Is there math, a stat test, or a process to figure out which variables are important, or if there is some effect from combining them?
I have Eureqa, but usually I don't have much luck when the outcome is 0 or 1. I also have Octave, which may have better functions for this type of task.
thanks in advance -- stat test names or wikipedia articles to get me started are very helpful, and sorry if my description does not make much sense, my statistics knowledge is not very deep, I'll try to study more
