Polynomial Regression analysis

Analysis title: Polynomial Regression analysis
Provider: Institute of Systems Biology
Class: PolynomialRegressionAnalysis
Plugin: ru.biosoft.analysis (Common methods of data analysis plug-in)

Polynomial regression analysis.

Regression analysis is performed for each row in experimental data independently. Consider:

Y = {Y₁...Y_m} — gene expression values.
X = {X₁...X_m} — corresponding time poins.

Value Y_i is measured at the time point X_i. Analysis constructs polynomial regression:

For each estimated regression coefficient, the P-value will be calculated, but P-value threshold will be applied only on the last coefficient (with largest power).

Parameters:

Experiment - experimental data for analysis.
- Table - a table data collection stored in the BioUML repository.
- Columns - the columns from the table which should be taken into account for futher analysis. Note that in order to ensure correct analysis you should specify the corresponding time point for each column. Time points also should ascend!
Regression power - the positive value representing power to construct regression.
P-value threshold - thresold for P-value (only elements with lower P-value will be included in the result table).
Outline boundaries - lower and upper boundaries for values from the input table. Outliers will be ignored.
Calculate FDR - the test method for calculation of False Discovery Rate (FDR) - an average rate of mistakenly builded regressions with the given P-value threshold. It randomly permutates the data 50 times and applies regression analysis to each randomized test. FDR is calculated according to the formula:
Output table - the path in BioUML repository where the result table will be stored. If a table with the specified path already exists it will be replaced. The table will contain the sum of square errors, coefficients with their scores (log10(P-value)) and graphics for original and approximated profiles.