From john.bucknall at castrol.com Mon Mar 11 07:37:31 2024 From: john.bucknall at castrol.com (Bucknall, John) Date: Mon, 11 Mar 2024 11:37:31 +0000 Subject: [Pandas-dev] Relatively simple process for multi column Fuzzy Merge > possible enhancement to merge_asof?? Message-ID: HI I have a suggestion for a relatively simple addition to the pandas merge tools as I just had to code up a join on some columns with fuzzy merge of multiple columns in pandas. I noted on a few stack overflow threads that a multi column fuzzy merge was lacking > so is this of interest and who do I chat to that knows how to build it as a pandas function? I seemed to go round the block about 9 times to get there but its seems to work well for my application. Kind regards John Bucknall CEng Marketing Effectiveness Data Scientist Castrol Global Marketing ? Mobile: +44 (0)7880 926 331 [cid:image001.png at 01DA73A8.2588E740]? > Want to know more? check out our Neurodiversity bitesize Without a breadth of data you can?t have lateral thinking? BP International Limited. Registered office: Chertsey Road, Sunbury on Thames, Middlesex, TW16 7BP. Registered in England and Wales, number 542515. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 11281 bytes Desc: image001.png URL: From garcia.marc at gmail.com Mon Mar 11 10:30:13 2024 From: garcia.marc at gmail.com (Marc Garcia) Date: Mon, 11 Mar 2024 15:30:13 +0100 Subject: [Pandas-dev] Relatively simple process for multi column Fuzzy Merge > possible enhancement to merge_asof?? In-Reply-To: References: Message-ID: I think it's probably better to suggest this as an enhancement in a GitHub issue than here. I personally don't think it's likely to be accepted, as pandas is already too big and this seems tricky to maintain. But it can surely be implemented as a third party package, which not much difference to users than if it was in pandas itself. You can find some documentation about this here: https://pandas.pydata.org/pandas-docs/stable/development/extending.html On Mon, Mar 11, 2024 at 1:31?PM Bucknall, John via Pandas-dev < pandas-dev at python.org> wrote: > HI > > I have a suggestion for a relatively simple addition to the pandas merge > tools as I just had to code up a join on some columns with *fuzzy merge > of multiple columns* in pandas. I noted on a few stack overflow threads > that a multi column fuzzy merge was lacking > so is this of interest and > who do I chat to that knows how to build it as a pandas function? > > I seemed to go round the block about 9 times to get there > but its seems to work well for my application. > > > > Kind regards > > *John Bucknall** CEng* > Marketing Effectiveness Data Scientist > Castrol Global Marketing > ( Mobile: +44 (0)7880 926 331 > > ? > Want to know more? check out our Neurodiversity bitesize > > > > > > *Without a **breadth** of **data** you can?t have **lateral** thinking?* > > BP International Limited. Registered office: Chertsey Road, Sunbury on > Thames, Middlesex, TW16 7BP. Registered in England and Wales, number 542515. > > > _______________________________________________ > Pandas-dev mailing list > Pandas-dev at python.org > https://mail.python.org/mailman/listinfo/pandas-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 11281 bytes Desc: not available URL: From diogojarodrigues at icloud.com Mon Mar 25 09:11:47 2024 From: diogojarodrigues at icloud.com (diogojarodrigues at icloud.com) Date: Mon, 25 Mar 2024 13:11:47 +0000 Subject: [Pandas-dev] Possible bug fix Message-ID: <48BC5128-07F8-4CDB-9851-E1B838C08A67@icloud.com> Hi everyone! For the past weeks I have been trying to fix this issue . For what I could understand that error is being raised on the function ?pandas_dtype? in the file common.py. The error occurred when you call the function np.dtype when the argument is a pyarrow list, numopy doesn?t recognize it. So to solve this I make a simple if statement like this: ? Do you think this is a good solution? I also have a scrypt to run the code. I will attach to images with the script and the output. Thanks for your thoughts, Diogo Rodrigues ?? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Captura de ecra? 2024-03-25, a?s 13.08.11.png Type: image/png Size: 161736 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Captura de ecra? 2024-03-25, a?s 13.10.13.png Type: image/png Size: 198660 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Captura de ecra? 2024-03-25, a?s 13.10.42.png Type: image/png Size: 115562 bytes Desc: not available URL: