Boot.dev Blog ยป Bitcoin ยป Base64 vs Base58 Encoding

Base64 vs Base58 Encoding

By Lane Wagner on Nov 3, 2020

Curated backend podcasts, videos and articles. All free.

Want to improve your backend development skills? Subscribe to get a copy of The Boot.dev Beat in your inbox each month. It's a newsletter packed with the best content for new backend devs.

Base64 is one of the most popular encoding formats for representing data. Have some binary data? Base64 encodes it for convenient readability and parsing. Base58 is just another encoding format (with 58 characters instead of 64, and has gained popularity largely due to Bitcoin and other cryptocurrencies. Also, if you came here confused, encryption and encoding are not the same! Take a look at this article for more information on encryption vs encoding.

When it comes to data encoding, there is typically a trade-off made between:

  • Human Readability: Do humans have a good idea of what is being represented at a glance?
  • Efficient data compression: How many bytes are used to represent the same data, and how many characters are available?

Let’s rank Base58, Base64, and ASCII encoding against each other using these metrics!

First: A Note on Binary ๐Ÿ”—

All data is stored in a raw binary format on computers. These encoding formats (Base64, Base58, and ASCII) are just different ways of reading and writing binary data. For example, in Base64 the binary code 000000 represents the letter A, but in ASCII the binary 00000000 represents the NUL character.

ASCII Encoding ๐Ÿ”—

ASCII encoding chart

Human Readability Rank: 1st ๐Ÿ”—

Not all human language characters are possible using ASCII, but the most important ones are. ASCII is meant to be used in applications that need a simple way to represent Latin-based text, Arabic numerals, and formatting characters like newlines and spacing. ASCII is based on how typewriters worked in the pre-computer days.

Data Compression Rank: 3rd ๐Ÿ”—

Each character (typically) takes an entire byte (8 bits) of data. To represent the NUL character, we need to store 00000000, which is a lot of wasted zeros! It is worth noting that ASCII can work with only 7 bits, but because computers work in base-2, it is more simple to use a full byte.

You could also call ASCII Base128 because its alphabet is made up of 128 characters.

Base64 ๐Ÿ”—

base64 chart

Human Readability Rank: 3rd ๐Ÿ”—

Base64 is designed to carry data stored in binary formats across channels that only reliably support text content

Wikipedia

Base64 was essentially designed to trick computers… kind of. When we have binary but are only allowed to transport or display text, Base64 is a great choice.

Data Compression Rank: 2nd ๐Ÿ”—

Each character only takes 6 bits of data, and a padding character, =, is used to round to the nearest multiple of 4.

Base58 ๐Ÿ”—

base58 chart

Human Readability Rank: 2nd ๐Ÿ”—

Satoshi Nakamoto, also the anonymous creator of Bitcoin, invented Base58. The goal was to get data compression levels comparable to that of Base64 but to make it easier for humans to read by eliminating characters that look similar to 0 (zero), O (capital o), I (capital i), l (lower case L). Alphanumeric characters + (plus) and / (slash) were also omitted for readability.

Data Compression Rank: 3rd ๐Ÿ”—

Very similar to that of Base64, but due to 6 fewer characters, not quite as efficient. Additionally, the parsing is slightly more awkward due to the base not being a power of 2.

Find a problem with this article?

Report an issue on GitHub