Find Minimum for element in multiple dimensional array

Robert Davis rdavis7408 at gmail.com
Wed Jul 22 18:54:06 EDT 2015


Given a set of arrays within an array how do I find the arrays with the minimum values based on two elements/columns in the array? Those two elements/columns are the destination zip code and distance.

I have an array of arrays that have a origin zip code, origin latitude, origin longitude, destination zip code, destination latitude, destination longitude, and miles between the two points.

I need to keep only those combinations that represent the minimum mileage between to the destination zip code. For example a point in New Jersey may have a distance from the Philadelphia Office that is 45 miles, from the Newark Office that is 78 miles and one from the Delaware Office that is 58 miles.

I need to keep the mileage from the Philadelphia Office that is 45 miles and produce a .csv file that has origin zip code, origin latitude, origin longitude, destination zip code, destination latitude, destination longitude, and miles between the two points.

The array looks like this:

[['37015', 'TN31', 36.2777, -87.0046, 'NY', 'White Plains', '10629', 41.119008, -73.732996, 77.338920003], 
['72202', 'ARB1', 34.739224, -92.27765, 'NY', 'White Plains', '10629', 41.119008, -73.732996, 1099.7837975322097]]

My code looks like this :

import csv
import math


def calculate_distance(lat1, lon1, lat2, lon2):

    if (not lat1) or (not lon1) or (not lat2) or (not lon2):
            return -1

    lat1 = float(lat1) * math.pi/180
    lon1 = float(lon1) * math.pi/180
    lat2 = float(lat2) * math.pi/180
    lon2 = float(lon2) * math.pi/180

    return 3959.0 * math.acos(math.sin(lat1) * math.sin(lat2) +   math.cos(lat1) * math.cos(lat2) * math.cos(lon2-lon1))

#Above function changed from the following URL: http://iamtgc.com/geocoding- with-python/


InputPath = "C:\\Users\\jacobs\\Downloads\\ZipCodes\\"

ZipCodes = "zipcode.csv"
RptgOfficeFile = "Reporting_Office_2015072001.csv"
InputFile = InputPath+RptgOfficeFile
zInputFile = InputPath+ZipCodes
zOutputFile = InputPath+'Zip_Code_Distance.csv'
z1OutputFile = InputPath+'Minimum_Distance_Zip_Code_File.csv'


f = open(InputFile, 'r')

zO = open(zOutputFile,'w')
z1 = open(z1OutputFile,'w')

lines = [ ]
OfficeZipcodes = []
ZipRptOffice = {}
OLatitude = [ ]
OLongitude = [ ]
OLocationCode = []
dzip = []
dLatitude = []
dLongitude = []
dCity = []
dState = []
Combined =[]
Answers = []

for line in f:
  l = [i.strip() for i in line.split(',')]
  OfficeZipcodes.append(l[4])
  ZipRptOffice[l[4]]= l[3]
  OLatitude.append(l[5])
  OLongitude.append(l[6])
  OLocationCode.append(l[3])

del OfficeZipcodes[0]
del OLatitude[0] 
del OLongitude[0]
del OLocationCode[0]


zf = csv.DictReader(open(zInputFile))
#http://courses.cs.washington.edu/courses/cse140/13wi/csv-parsing.html

for row in zf:
    dzip.append(row["zip"])
    dLatitude.append(float(row["latitude"]))
    dLongitude.append(float(row["longitude"]))
    dCity.append(row["city"])
    dState.append(row["state"])


for i in range(len(OfficeZipcodes)):
    for j in range(len(dzip)):
        Distance = calculate_distance(OLatitude[i], OLongitude[i],dLatitude[j],dLongitude[j])
        Combined.append([OfficeZipcodes[i], OLocationCode[i],float(OLatitude[i]),float(OLongitude[i]),dState[j],dCity[j],dzip[j], dLatitude[j],dLongitude[j],Distance])
for i in range(len(Combined)):
  zO.write(str(Combined[i][0])+","+str(Combined[i][1])+","+str(Combined[i][2])+","+ str(Combined[i][3])+","+str(Combined[i][4])+","+ str(Combined[i][5])+","+ str(Combined[i][6])+","+str(Combined[i][7])+","+ str(Combined[i][8])+","+str(Combined[i][9])+"\n")

zO.close()
f.close()

I am using Python 2.7 on a Windows 7 machine.

Please help me get my head around how to accomplish this task.

Thank you very much.

Robert Davis



More information about the Python-list mailing list