Problem in defining multidimensional array matrix and regression

shalu.ashu50 at gmail.com shalu.ashu50 at gmail.com
Sun Nov 19 10:31:28 EST 2017


Hi, All,

I have 6 variables in CSV file. One is rainfall (dependent, at y-axis) and others are predictors (at x). I want to do multiple regression and create a correlation matrix between rainfall (y) and predictors (x; n1=5). Thus I want to read rainfall as a separate variable and others in separate columns, so I can apply the algo. However, I am not able to make a proper matrix for them. 

Here are my data and codes?
Please suggest me for the same.
I am new to Python.

RF	P1	P2	P3	P4	P5
120.235	0.234	-0.012	0.145	21.023	0.233
200.14	0.512	-0.021	0.214	22.21	0.332
185.362	0.147	-0.32	0.136	24.65	0.423
201.895	0.002	-0.12	0.217	30.25	0.325
165.235	0.256	0.001	0.22	31.245	0.552
198.236	0.012	-0.362	0.215	32.25	0.333
350.263	0.98	-0.85	0.321	38.412	0.411
145.25	0.046	-0.36	0.147	39.256	0.872
198.654	0.65	-0.45	0.224	40.235	0.652
245.214	0.47	-0.325	0.311	26.356	0.632
214.02	0.18	-0.012	0.242	22.01	0.745
147.256	0.652	-0.785	0.311	18.256	0.924

import numpy as np
import statsmodels as sm
import statsmodels.formula as smf
import csv

with open("pcp1.csv", "r") as csvfile:
    readCSV=csv.reader(csvfile)
    
    rainfall = []
    csvFileList = [] 
    
    for row in readCSV:
        Rain = row[0]
        rainfall.append(Rain)

        if len (row) !=0:
            csvFileList = csvFileList + [row]       
        
print(csvFileList)
print(rainfall)

Please suggest me guys
Thanks




More information about the Python-list mailing list