Hello ,
if at least two (or more persons) are working (simultaneously) on an online project -like CCWiki- duplicate texts (with diff-files) are existing.
What is in your opinion an appropriate tool to delete such duplicate files automatically?
http://www.pcworld.com/article/2039794/ ... files.html
http://www.techrepublic.com/blog/linux- ... matically/
http://help.eclipse.org/luna/index.jsp? ... ks-68b.htm
https://sourceforge.net/projects/winmerge/
https://www.lifewire.com/compare-two-te ... ux-3861434
Best wishes,
Norbert
CCWiki: diff-files
Moderators: hgm, Rebel, chrisw
-
- Posts: 1643
- Joined: Tue May 20, 2008 4:57 pm
- Location: Augsburg - Germany
-
- Posts: 397
- Joined: Sun Oct 29, 2006 4:38 am
- Location: Schenectady, NY
Re: CCWiki: diff-files
First of all, the auto-generated diff files at the Computer-Chess Wiki are all different from each other: there are NO duplicated files.
Secondly, the files are NOT caused by editing the same file simultaneously. In case of that situation, the last person to save becomes the current version. The wiki software uses a locking mechanism that mitigates this danger, but if your lock times out, there is a danger of stepping on each others edits.
Thirdly, they are not diff files at all, but contain the entire wikified text, but in compressed format. Each 'diff file' (compressed text file) contains the entire text of the current page. Every save creates a new file containing the entire page, but compressed.
My concern with 'diff files' at the CC-Wiki are that too many are generated when an editor, such as yourself, make many saves as you edit instead of just saving once when you are done editing. Although this can be dangerous in terms of losing work, it is the best way to work with the wiki software. As all of these so-called 'diff files' pile up, I run short on storage space. Eventually I need to spend several hours deleting old files, which are hard to find because they are scattered through many unrelated directories, and have unhelpful naming. I should write a tool for this activity, but have very little free time to do it.
I've written you with information similar to this message in private email. Have you forgotten?
Ron
Secondly, the files are NOT caused by editing the same file simultaneously. In case of that situation, the last person to save becomes the current version. The wiki software uses a locking mechanism that mitigates this danger, but if your lock times out, there is a danger of stepping on each others edits.
Thirdly, they are not diff files at all, but contain the entire wikified text, but in compressed format. Each 'diff file' (compressed text file) contains the entire text of the current page. Every save creates a new file containing the entire page, but compressed.
My concern with 'diff files' at the CC-Wiki are that too many are generated when an editor, such as yourself, make many saves as you edit instead of just saving once when you are done editing. Although this can be dangerous in terms of losing work, it is the best way to work with the wiki software. As all of these so-called 'diff files' pile up, I run short on storage space. Eventually I need to spend several hours deleting old files, which are hard to find because they are scattered through many unrelated directories, and have unhelpful naming. I should write a tool for this activity, but have very little free time to do it.
I've written you with information similar to this message in private email. Have you forgotten?
Ron
Norbert Raimund Leisner wrote:Hello ,
if at least two (or more persons) are working (simultaneously) on an online project -like CCWiki- duplicate texts (with diff-files) are existing.
What is in your opinion an appropriate tool to delete such duplicate files automatically?
http://www.pcworld.com/article/2039794/ ... files.html
http://www.techrepublic.com/blog/linux- ... matically/
http://help.eclipse.org/luna/index.jsp? ... ks-68b.htm
https://sourceforge.net/projects/winmerge/
https://www.lifewire.com/compare-two-te ... ux-3861434
Best wishes,
Norbert
-
- Posts: 5228
- Joined: Thu Mar 09, 2006 9:40 am
- Full name: Vincent Lejeune
Re: CCWiki: diff-files
Under windows, I use the freeware "DoubleKiller" ( http://www.bigbangenterprises.de /en/doublekiller/ ), it sorts files by size and CRC-32.Norbert Raimund Leisner wrote:Hello ,
if at least two (or more persons) are working (simultaneously) on an online project -like CCWiki- duplicate texts (with diff-files) are existing.
What is in your opinion an appropriate tool to delete such duplicate files automatically?
http://www.pcworld.com/article/2039794/ ... files.html
http://www.techrepublic.com/blog/linux- ... matically/
http://help.eclipse.org/luna/index.jsp? ... ks-68b.htm
https://sourceforge.net/projects/winmerge/
https://www.lifewire.com/compare-two-te ... ux-3861434
Best wishes,
Norbert