statsmodels.sandbox.regression.try_ols_anova.form2design¶
- statsmodels.sandbox.regression.try_ols_anova.form2design(ss, data)[source]¶
 convert string formula to data dictionary
- ssstr
 I : add constant
varname : for simple varnames data is used as is
F:varname : create dummy variables for factor varname
P:varname1*varname2 : create product dummy variables for varnames
G:varname1*varname2 : create product between factor and continuous variable
- datadict or structured array
 data set, access of variables by name as in dictionaries
- Returns:¶
 - vars
dictionary dictionary of variables with converted dummy variables
- names
list list of names, product (P:) and grouped continuous variables (G:) have name by joining individual names sorted according to input
- vars
 
Notes
with sorted dict, separate name list would not be necessary
Examples
>>> xx, n = form2design('I a F:b P:c*d G:c*f', testdata) >>> xx.keys() ['a', 'b', 'const', 'cf', 'cd'] >>> n ['const', 'a', 'b', 'cd', 'cf']
  
    
      Last update:
      Jun 10, 2024