Ludwig - cs/software_development/data

To B or not to B: B-Trees with Optimistic Lock Coupling

Added on June 27, 2025

CedarDB is a database system that delivers unmatched performance for transactions and analytics, from small writes to handling billions of rows. Built on cutting-edge research to power today’s tools and tomorrow’s challenges.

cs/software_development/concurrency cs/software_development/data_structures/btrees

GitHub - avinassh/py-caskdb: (educational) build your own disk based KV store

Added on December 16, 2024

(educational) build your own disk based KV store. Contribute to avinassh/py-caskdb development by creating an account on GitHub.

cs/software_development/data_structures/key_value_stores cs/systems_programming

A Fat Pointer Library

Added on October 27, 2024

libCello Official Website

cs/theory/programming_languages/c cs/software_development/data_structures/pointers

B-trees and database indexes

Added on September 13, 2024

B-trees are used by many modern DBMSs. Learn how they work, how databases use them, and how your choice of primary key can affect index performance.

cs/software_development/data_structures/btrees

Zig's BoundedArray

Added on August 27, 2024

A quick look at how and why to use Zig's BoundedArray.

cs/theory/programming_languages/zig cs/software_development/data_structures/arrays

The Hunt for the Missing Data Type

Added on August 3, 2024

A (directed) graph is a set of nodes, connected by arrows (edges). The nodes and edges may contain data. Here are some graphs: All graphs made with graphviz (source) Graphs are ubiquitous in software engineering: Package dependencies form directed graphs, as do module imports. The internet is a graph of links between webpages. Model checkers analyze software by exploring the “state space” of all possible configurations.

cs/software_development/data_structures/graphs

Zip Files All The Way Down

Added on July 27, 2024

The text discusses creating self-reproducing programs and files like zip files that can decompress to themselves. It explores using Lempel-Ziv compression for self-reproduction and the challenges of translating these concepts into real opcode encodings like DEFLATE used in gzip and zip files. The ultimate goal is to create a zip file that contains a larger copy of itself recursively, creating a chain of expanding zip files.

cs/software_development/data_structures/serialization

Criticizing Hare language approach for generic data structures

Added on July 17, 2024

The blog criticizes the Hare language approach for not providing generic data structures like hash maps in its standard library. It highlights the complexity and importance of hash tables in various programming languages and emphasizes the need for efficient data structures in modern programming ecosystems. The author disagrees with Hare's approach and stresses the significance of hash tables in software development.

cs/software_development/data_structures/hash_maps

Pointers

Added on July 15, 2024

Pointers in Zig allow variables to reference memory addresses. Understanding pointers helps manipulate memory effectively. Pointers are values that store memory addresses and can be nested within structures.

cs/theory/programming_languages/zig cs/software_development/data_structures/pointers

Cap'n Proto, FlatBuffers, and SBE

Added on July 1, 2024

FlatBuffers is a new serialization protocol released by Google engineers, similar to Cap’n Proto. Cap’n Proto allows random access using pointers, while FlatBuffers uses offsets stored in tables for random access. Protobufs, Cap’n Proto, and FlatBuffers have custom schema languages and different features for data serialization and access.

cs/software_development/data_structures/serialization

How to use hash map contexts to save memory when doing a string table

Added on June 30, 2024

The text explains how to save memory when building a string table using hash map contexts. By adapting context APIs, only indexes are stored in the table, reducing memory usage. This method can save 117 KB of memory for a string table with 10 thousand entries.

cs/theory/programming_languages/zig cs/software_development/data_structures/hash_maps

Zig's HashMap - Part 1

Added on June 24, 2024

Zig's std.HashMap implementation relies on two key functions: hash and eql. The documentation outlines various hash map types and their functionalities, including std.HashMapUnmanaged. AutoHashMap can automatically generate hash functions, but there are limitations, and custom contexts can be provided for more complex keys.

cs/theory/programming_languages/zig cs/software_development/data_structures/hash_maps

Prospecting for Hash Functions

Added on June 24, 2024

The text discusses the process of designing non-cryptographic integer hash functions, exploring different operations and constraints to create effective hash functions. It also compares various 32-bit hash functions and their bias levels, highlighting the search for high-quality hash functions with minimal bias for both 32-bit and 64-bit integers.

cs/software_development/data_structures/hash_maps cs/theory/algorithms

Uiuisms

Added on June 9, 2024

This text provides a list of Uiua functions for solving common problems. Contributors can add more functions to the list in the repository. Functions include splitting arrays, removing rows, upscaling matrices, and working with diagonal arrays.

cs/software_development/tutorials cs/software_development/data_structures/arrays

Bookmarks