There's many MD5 collision examples out there on the web, but as far as I can find, they all involve binary data inputs.
Are there any two known plain-text ASCII strings that give the same MD5 hash?
Basically I'm building a de-duplication system that stores plain text files such as JSON + XML, but it skips over any files that are detected to contain binary data. And I need a way to test how the system copes with two plain text (non-binary) files/strings that give the same MD5 hash.
I share a sample that was posted to Twitter!
md5("TEXTCOLLBYfGiJUETHQ4hAcKSMd5zYpgqf1YRDhkmxHkhPWptrkoyz28wnI9V0aHeAuaKnak") = md5("TEXTCOLLBYfGiJUETHQ4hEcKSMd5zYpgqf1YRDhkmxHkhPWptrkoyz28wnI9V0aHeAuaKnak")
https://twitter.com/realhashbreaker/status/1770161965006008570