Yeah it started commercially about a decade ago from what I can tell. Now it’s probably part of every ebook. It’s just a little code. Cheap and easy if you want to track leaks and/or pirating.
Old spy craft thing. Disseminate 12 copies of a doc, each with a very subtle difference. If that doc pops up somewhere it shouldn’t, you know where it came from.
It’s definitely still useful and easier to do now too. SpaceX and Tesla both allegedly use it to catch leakers. It’s usually done now with whitespace and/or invisible characters.
What worries me is, what if they put in some obfudcated unique ID in the document that can be traced back to you? Is there a way to catch it with reasonable certainty?
I doubt OP is a publisher. Nobody goes after the little guy (except when they want to make an example of someone, but the odds of that happening to you are slim to none).
I regularly get materials with identifying information embedded in them, it’s very common in the technical publishing industry, not unreasonable to check a purchased eBook for an identifier which ties it back to the original purchaser.
Think there was a github repo with a script which reads out the visible information from an epub and puts it into a new one. Basically copying it, but leaving everything extra behind. Sadly could not find it at the moment.
Remove watermarks, metadata, etc first. Whatever could be used to identify you.
Also run spell check. Specific misspellings are another tracking method.
Reminds me of “paper street”
Is that why they’ve been more prevalent? Just thought terrible editor not intentionally making books worse.
why don’t people want to pay full price for ebooks
It’s a spy/security thing, never actually heard of it being used for something like ebooks.
Yeah it started commercially about a decade ago from what I can tell. Now it’s probably part of every ebook. It’s just a little code. Cheap and easy if you want to track leaks and/or pirating.
Old spy craft thing. Disseminate 12 copies of a doc, each with a very subtle difference. If that doc pops up somewhere it shouldn’t, you know where it came from.
LOL, probably not so useful in the digital age.
It’s definitely still useful and easier to do now too. SpaceX and Tesla both allegedly use it to catch leakers. It’s usually done now with whitespace and/or invisible characters.
What worries me is, what if they put in some obfudcated unique ID in the document that can be traced back to you? Is there a way to catch it with reasonable certainty?
you can view and edit every bit of metadata inside Calibre. If you want to make sure, go and edit the metadata there.
Yeah but what if the fingerprint is more tricky? Like a blank line with the font called afzer the fingerprint? Metadata inside of an image?
To be reasonable safe one would have to buy the book from multiple accounts and compare some hashes.
For fucks sake I’m pretty privacy conscious but you guys just take it too far
I’m pretty sure there was at least one publisher caught experimenting with this.
I doubt OP is a publisher. Nobody goes after the little guy (except when they want to make an example of someone, but the odds of that happening to you are slim to none).
I regularly get materials with identifying information embedded in them, it’s very common in the technical publishing industry, not unreasonable to check a purchased eBook for an identifier which ties it back to the original purchaser.
I have no purchased ebooks, but am genuinely curious; how trivial is it to remove watermarks and metadata?
Think there was a github repo with a script which reads out the visible information from an epub and puts it into a new one. Basically copying it, but leaving everything extra behind. Sadly could not find it at the moment.