June 24, 2020

So this happened.

I decided to try out this lossless text-compression demonstration site by Fabrice Bellard. It uses GPT-2 natural language generation and prediction to achieve compression. As sample text, I used the first paragraph of Donald Trump’s recent rally speech in Tulsa, Oklahoma. (I figured if anything can compress well using predictive machine learning, surely Trump’s speech patterns can.)

Here’s the compressor site, with most of the input and all of the output showing:

compression page with both input and output displayed

The output looks like a short string mixing Chinese and Korean because the compressed text is represented as a series of Unicode characters (encoding 15 bits of information per character — which makes the compression ratio displayed, 804/49, a bit misleading, since the characters on the bottom are twice as large as the characters on the top: 402/49 would be more more accurate, and still quite impressive).

Anyway, I naturally thought “Hmm! What would happen if I were to paste this presumably random Chinese/Korean output into Google Translate?”

“I am a prisoner, and I am in a state of mind.”

Aren’t we all, Internet? Aren’t we all?

Latest posts

The Right to Lie: Google’s “Web Environment Integrity” Proposal is a Geyser of Badness Threatening to Swamp the Open Web.

July 29, 2023

If your computer can’t lie to other computers, then it’s not yours. This is a fundamental principle of free and

rants.org

So this happened.

Leave a Reply

Latest posts

The Right to Lie: Google’s “Web Environment Integrity” Proposal is a Geyser of Badness Threatening to Swamp the Open Web.

count-fold-lines: Emacs hack to fold duplicate lines and count them.

Twelve Pieces of Classical Music, for Jim

Why not to sign the anti-Stallman petition on GitHub.

Actual comment from a LaTeX document that I am writing now.

Don’t Cover For, Just Cover: How to Report on Trump

Why the Internet Archive’s National Emergency Library is a Good Idea.

Ethics Enforcement Via Software Licenses Considered Harmful.

SOLVED: ‘apt-get dist-upgrade’ error when going from Debian 9.x (“stretch”) to 10.0 (“buster”).

Archives