DuckDB for Python Developers
If you have ever tried to run a quick aggregation on a 3GB CSV file in pandas, you know the ritual: wait for it to load into the memory, watch your RAM climb, maybe get a Memory Er...
Search fresh public links, source activity, and post angles for Duckdb.
Fresh curated links around duckdb are collected here so marketers can spot useful updates and turn timely ideas into posts faster.
Recent items include:
Recent curated links from global sources. Generate one free draft from any story, then use SocialBu to schedule and refine your content calendar.
If you have ever tried to run a quick aggregation on a 3GB CSV file in pandas, you know the ritual: wait for it to load into the memory, watch your RAM climb, maybe get a Memory Er...
just because we want to make em' mad
Batching teensy changes in chunks creates massive performance boost, DuckDB Labs team claims The team behind in-process OLAP database DuckDB has put forward a solution to the "smal...
why isn't anyone talking about this?
If you've ever loaded a 2 GB CSV into pandas just to run a few aggregations — and watched your machine struggle — there's a better tool for the... The post Stop Using Pandas for Ag...
Core Features pg-cdc is not just replication. pg-cdc streams Postgres Write Ahead Logs(WAL) out of production Postgres into typed, immutable, time-travelable Iceberg tables on S3...
Everything in this stack runs on your laptop. No API keys. No cloud bills. No data leaving the building. Here’s how to build it.Continue reading on Medium »
Comments
a lesson in fundamentals
Apache Doris 4.1 добавляет UPDATE, DELETE и MERGE INTO на Iceberg-таблицы прямо из SQL-клиента — без отдельного Spark job. Iceberg V3 Deletion Vectors и Row Lineage делают этот DML...
Apache Doris 4.1 добавляет UPDATE, DELETE и MERGE INTO на Iceberg-таблицы прямо из SQL-клиента — без отдельного Spark job. Iceberg V3 Deletion Vectors и Row Lineage делают этот DML...
it's always something you know
DB GUI is a Database querying desktop app built in Ruby. It supports PostgreSQL to start. Version 0.4.0 added support for remembering SQL command history. I needed that feature at...
Moving data from your operational database has traditionally meant setting up and...
Apache Spark is one of the most powerful tools in the data and AI engineering world. It helps process massive datasets and is widely used across industries, irrespective of cloud p...
A couple of weeks ago I published the redb.Core intro post — what RedBase is at the API level, why I wrote it, what production looks like, the LINQ surface, what generated SQL look...
Databricks SQL logs key attributes of every query automatically: who ran it, on which...
Ever since my last job I have been wanting to make this. I think it's not the first time I do it, but for one reason or another, I did it (again?) in two evenings only. In that job...
Use SocialBu to discover ideas, generate post drafts, and schedule them across your social channels.