Prompt: Write python script that I can run in my computer which can identify all the files that are file type: picture.
Using the python script, make a list of these picture files and index these files with name of directory, file name, file size, date of creation and file extension. Once this list is made, then the script checks which files are duplicate of another file by checking the date of creation, file size, checksum or another more accurate method. Check each file for a duplicate. The expected output of the script is to identify the picture files that are duplicate based on creation date, file size, checksum in my computer, and provide a list of these files identified and the directory path, file name, size, creation date, and next to each file, give the duplicate file name, directory path, size, creation date. Each row must contain one file that has a duplicate, and include all duplicates in the same row. Once a file is part of one row, do not repeat it on another row again. Exclude the files already identified and listed. Provide a final outcome of the duplicate file list that contains rows of a file and its duplicate files. In the beginning of the script, the user is asked about the directory where the search is to start and the script must use all sub directories. Do not dump output on the screen, rather place the output in a csv file. The duplicate file is searched for in all directories, and not only where the original file is found. At the end, do mention how many files were analyzed and how many duplicate files were found and how many directories were searched.
The final output would be,
Unique File list:
In the next step, make a list of files that are file type : picture and include only those files that are unique and DO NOT have any copies and these will not be part of the list made earlier called Duplicate File list. In the list, include the path of the directory, file name, file size, creation date, file type and checksum. The output of this list is a csv file. Also mention how many files total files are present.
Complete File list:
Combine the File list from Duplicate File list and Unique file list and provide the output in the csv format. The complete file list must contain unique files identified from the Unique file list and one set of files with the file name, directory, file size, check sum from each row of the duplicate file list. Do not add the duplicate files, only one file per row to be added. Provide the Complete file list.
User review:
Ask the user to review the Complete File List and ask for an upload of an input FILE LIST TO COPY which is a subset of the Complete File list that the user has reviewed and has deleted some entries that are not needed.
Ask the user to provide a directory path where files need to be copied called DESTINATION DIRECTORY.
Copy unique file to destination:
Using the directory path provided by the user, copy all files in the FILE LIST TO COPY list that includes file name and the directory path where each file is present, to the destination directory that the user has provided.
In final summary, provide the count of files copied and time it took from start to finish.
RESULT: Gemini, Deepseek, Manus, Ernie, Qwen, ChatGPT all are good at the task.
Somewhat behind are Copilot, Perplexity, Grok, Claude, Kimi, Mistral
Minimax failed in removing duplicate files