Delete duplicate rows in textfile - except it contains a "{" or "}"

Dave Angel d at davea.name
Wed Oct 10 09:28:10 EDT 2012


On 10/10/2012 04:51 AM, Joon Ki Choi wrote:
> Hello Pythonistas,
>
> i have a very large textfile with contents like:
>
> @INBOOK{Ackermann1999-b,
>   author = {Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann,
> 	K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F.
> 	and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and
> 	Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann,
> 	K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F.
> 	and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and
> 	Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann,
> 	K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F.
> 	and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and
> 	Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann},
>   year = {1980},
>   timestamp = {1995-12-02}
> }	
>
> And i want to delete the duplicate rows except these rows containing the brackets { or }. 
> The result should look like:
>
> @INBOOK{Ackermann1999-b,
>   author = {Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann,
> 	Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann},
>   year = {1980},
>   timestamp = {1995-12-02}
> }

Which is it?  Do you want to match your output, or match your
description?  Your description would result in:

@INBOOK{Ackermann1999-b,
  author = {Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann,
	K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F.
	and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and
	Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann,
	Ackermann, K.-F. and Ackermann, K.-F. and Ackermann, K.-F. and Ackermann},
  year = {1980},
  timestamp = {1995-12-02}
}	

(that's doing it by eyeball, so i may have missed some minor differences)



-- 

DaveA




More information about the Python-list mailing list