News Whirlpool: November 2024

Saturday, November 30, 2024

New top story on Hacker News: Show HN: Jinbase – Multi-model transactional embedded database

Show HN: Jinbase – Multi-model transactional embedded database
4 by alexrustic | 0 comments on Hacker News.
Hi HN ! Alex here. I'm excited to show you Jinbase ( https://ift.tt/fQNiKDu ), my multi-model transactional embedded database. Almost a year ago, I introduced Paradict [1], my take on multi-format streaming serialization. Given its readability, the Paradict text format appears de facto as an interesting data format for config files. But using Paradict to manage config files would end up cluttering its programming interface and making it confusing for users who still have choices of alternative libraries (TOML, INI File, etc.) dedicated to config files. So I used Paradict as a dependency for KvF (Key-value file format) [2], a new project of mine that focuses on config files with sections. With its compact binary format, I thought Paradict would be an efficient dependency for a new project that would rely on I/O functions (such as Open, Read, Write, Seek, Tell and Close) to implement a minimalistic yet reliable persistence solution. But that was before I learned that "files are hard" [3]. SQLite with its transactions, BLOB data type and incremental I/O for BLOBs seemed like the right giant to stand on for my new project. Jinbase started small as a key-value store and ended up as a multi-model embedded database that pushes the boundaries of what we usually do with SQLite. The first transition to the second data model (the depot) happened when I realized that the key-value store was not well suited for cases where a unique identifier is supposed to be automatically generated for each new record, saving the user the burden of providing an identifier that could accidentally be subject to a collision and thus overwrite an existing record. After that, I implemented a search capability that accepts UID ranges for the depot store, timespans (records are automatically timestamped) for both the depot and key-value stores and GLOB patterns and number ranges for string and integer keys in the key-value store. The queue and stack data models emerged as solutions for use cases where records must be consumed in a specific order. A typical record would be retrieved and deleted from the database in a single transaction unit. Since SQLite is used as the storage engine, Jinbase supports the relational model de facto. For convenience, all tables related to Jinbase internals are prefixed with "jinbase_", making Jinbase a useful tool for opening legacy SQLite files to add new data models that will safely coexist with the ad hoc relational model. All four main data models (key-value, depot, queue, stack) support Paradict-compatible data types, such as dictionaries, strings, binary data, integers, datetimes, etc. Under the hood, when the user initiates a write operation, Jinbase serializes (except for binary data), chunks, and stores the data iteratively. A record can be accessed not only in bulk, but also with two levels of partial access granularity: the byte-level and the field-level. While SQLite's incremental I/O for BLOBs is designed to target an individual BLOB column in a row, Jinbase extends this so that for each record, incremental reads cover all chunks as if they were a single unified BLOB. For dictionary records only, Jinbase automatically creates and maintains a lightweight index consisting of pointers to root fields, which then allows extracting from an arbitrary record the contents of a field automatically deserialized before being returned. The most obvious use cases for Jinbase are storing user preferences, persisting session data before exit, order-based processing of data streams, exposing data for other processes, upgrading legacy SQLite files with new data models and bespoke data persistence solutions. Jinbase is written in Python, is available on PyPI and you can play with the examples on the README. Let me know what you think about this project. [1] https://ift.tt/nCZvDqX [2] https://ift.tt/OMHbz7F [3] https://ift.tt/Q3cwUCG

New top story on Hacker News: If not React, then what?

If not React, then what?
84 by pier25 | 153 comments on Hacker News.

New top story on Hacker News: You must read at least one book to ride

You must read at least one book to ride
13 by Kinrany | 1 comments on Hacker News.

Friday, November 29, 2024

New top story on Hacker News: How We Got the Lithium-Ion Battery

How We Got the Lithium-Ion Battery
12 by JumpCrisscross | 0 comments on Hacker News.

Thursday, November 28, 2024

New top story on Hacker News: Spotify has shut down several API endpoints

Spotify has shut down several API endpoints
86 by leecoursey | 70 comments on Hacker News.

New top story on Hacker News: The UX of Lego Interface Panels (2020)

The UX of Lego Interface Panels (2020)
49 by rcdemski | 2 comments on Hacker News.

New top story on Hacker News: Tk9.0: CGo-free, cross platform GUI toolkit for Go

Tk9.0: CGo-free, cross platform GUI toolkit for Go
5 by nateb2022 | 1 comments on Hacker News.

Wednesday, November 27, 2024

New top story on Hacker News: The Rise of Bluesky

The Rise of Bluesky
15 by g0xA52A2A | 1 comments on Hacker News.

New top story on Hacker News: A Deep Dive into DDPMs

A Deep Dive into DDPMs
3 by mariuz | 0 comments on Hacker News.

Tuesday, November 26, 2024

New top story on Hacker News: Hats Off to NASA's Webb: Sombrero Galaxy Dazzles in New Image

Hats Off to NASA's Webb: Sombrero Galaxy Dazzles in New Image
10 by speckx | 0 comments on Hacker News.

New top story on Hacker News: A Revolution in How Robots Learn

A Revolution in How Robots Learn
12 by jsomers | 1 comments on Hacker News.

Monday, November 25, 2024

New top story on Hacker News: Show HN: I am Building a Producthunt alternative

Show HN: I am Building a Producthunt alternative
6 by heyarviind2 | 14 comments on Hacker News.

New top story on Hacker News: Fleng 22 (concurrent logic programming)

Fleng 22 (concurrent logic programming)
22 by 082349872349872 | 1 comments on Hacker News.

Sunday, November 24, 2024

New top story on Hacker News: Lunatic Fringe is a game originally distributed as an AfterDark screensaver

Lunatic Fringe is a game originally distributed as an AfterDark screensaver
3 by threekindwords | 0 comments on Hacker News.

New top story on Hacker News: Judge Rules in Favor of School That Gave Student a Bad Grade for Using AI

Judge Rules in Favor of School That Gave Student a Bad Grade for Using AI
10 by pseudolus | 11 comments on Hacker News.

Saturday, November 23, 2024

New top story on Hacker News: S3 Express Append has issues

S3 Express Append has issues
7 by pdeva1 | 1 comments on Hacker News.

New top story on Hacker News: The tech utopia fantasy is over

The tech utopia fantasy is over
74 by mooreds | 26 comments on Hacker News.

Friday, November 22, 2024

New top story on Hacker News: MIT researchers develop an efficient way to train more reliable AI agents

MIT researchers develop an efficient way to train more reliable AI agents
7 by geox | 1 comments on Hacker News.

New top story on Hacker News: Rendering "modern" Winamp skins in the browser

Rendering "modern" Winamp skins in the browser
6 by mariuz | 0 comments on Hacker News.

Thursday, November 21, 2024

New top story on Hacker News: Listen to the whispers: web timing attacks that work

Listen to the whispers: web timing attacks that work
26 by saikatsg | 0 comments on Hacker News.

Wednesday, November 20, 2024

New top story on Hacker News: The Northeast is becoming fire country

The Northeast is becoming fire country
4 by gregorymichael | 0 comments on Hacker News.

Tuesday, November 19, 2024

New top story on Hacker News: The Data Engineering Handbook

The Data Engineering Handbook
9 by matthewhefferon | 0 comments on Hacker News.

New top story on Hacker News: The Deep Sea

The Deep Sea
11 by takinola | 0 comments on Hacker News.

Monday, November 18, 2024

New top story on Hacker News: Towards Nyquist Learners

Towards Nyquist Learners
6 by sleepingreset | 1 comments on Hacker News.

Sunday, November 17, 2024

New top story on Hacker News: Hobby Project: A dynamic C (Hot reloading) module-based Web Framework

Hobby Project: A dynamic C (Hot reloading) module-based Web Framework
2 by warothia | 1 comments on Hacker News.

Saturday, November 16, 2024

New top story on Hacker News: Don't Look Twice: Faster Video Transformers with Run-Length Tokenization

Don't Look Twice: Faster Video Transformers with Run-Length Tokenization
15 by jasondavies | 3 comments on Hacker News.

New top story on Hacker News: SICP The only computer science book worth reading twice (2010)

SICP The only computer science book worth reading twice (2010)
6 by pieterr | 0 comments on Hacker News.

New top story on Hacker News: Show HN: I built a(nother) house optimized for LAN parties

Show HN: I built a(nother) house optimized for LAN parties
7 by kentonv | 1 comments on Hacker News.

Friday, November 15, 2024

New top story on Hacker News: Go-taskflow: A taskflow-like General-purpose Task-parallel Programming Framework

Go-taskflow: A taskflow-like General-purpose Task-parallel Programming Framework
2 by noneback | 1 comments on Hacker News.

Thursday, November 14, 2024

New top story on Hacker News: OpenAI, Google and Anthropic are struggling to build more advanced AI

OpenAI, Google and Anthropic are struggling to build more advanced AI
39 by lukebennett | 96 comments on Hacker News.

Wednesday, November 13, 2024

New top story on Hacker News: Steve Jobs, NeXTSTEP, and early object-oriented programming (2016)

Steve Jobs, NeXTSTEP, and early object-oriented programming (2016)
26 by wmlive | 4 comments on Hacker News.

Tuesday, November 12, 2024

New top story on Hacker News: Ohmaps: your image montage is a resistor network

Ohmaps: your image montage is a resistor network
7 by occular | 4 comments on Hacker News.

New top story on Hacker News: Large Language Models in National Security Applications

Large Language Models in National Security Applications
29 by bindidwodtj | 3 comments on Hacker News.

Monday, November 11, 2024

New top story on Hacker News: Making a trading Gameboy: A pocket exchange and algo trading platform

Making a trading Gameboy: A pocket exchange and algo trading platform
15 by bluestreak | 1 comments on Hacker News.

New top story on Hacker News: Brian Kernighan Reflects on Unix: A History and a Memoir [video]

Brian Kernighan Reflects on Unix: A History and a Memoir [video]
27 by zdw | 3 comments on Hacker News.

Sunday, November 10, 2024

New top story on Hacker News: Procrastination and the fear of not being good enough

Procrastination and the fear of not being good enough
63 by swapxstar | 25 comments on Hacker News.

Saturday, November 9, 2024

New top story on Hacker News: Somebody moved UK's oldest satellite

Somebody moved UK's oldest satellite
35 by mindracer | 11 comments on Hacker News.

New top story on Hacker News: There aren't enough smart people in biology doing something boring

There aren't enough smart people in biology doing something boring
33 by abhishaike | 14 comments on Hacker News.

New top story on Hacker News: IronCalc – Open-Source Spreadsheet Engine

IronCalc – Open-Source Spreadsheet Engine
40 by kaathewise | 9 comments on Hacker News.

New top story on Hacker News: Jaws – a JavaScript to WASM ahead of time compiler

Jaws – a JavaScript to WASM ahead of time compiler
27 by drogus | 11 comments on Hacker News.

Friday, November 8, 2024

New top story on Hacker News: Why 4D geometry makes me sad [video]

Why 4D geometry makes me sad [video]
8 by surprisetalk | 0 comments on Hacker News.

New top story on Hacker News: Judge: Zuckerberg not liable for social media harm to children

Judge: Zuckerberg not liable for social media harm to children
3 by mikece | 0 comments on Hacker News.

New top story on Hacker News: Making Electronic Calipers

Making Electronic Calipers
6 by surprisetalk | 0 comments on Hacker News.

Thursday, November 7, 2024

New top story on Hacker News: Masters of Horror and Magic: The German folklorists who helped build a nation

Masters of Horror and Magic: The German folklorists who helped build a nation
6 by benbreen | 1 comments on Hacker News.

Wednesday, November 6, 2024

New top story on Hacker News: Launch HN: Midship (YC S24) – Turn PDFs and Images into usable data

Launch HN: Midship (YC S24) – Turn PDFs and Images into usable data
12 by maxmaio | 12 comments on Hacker News.
Hey HN, we are Max, Kieran, and Aahel from Midship ( https://midship.ai ). Midship makes it easy to extract data from unstructured documents like pdfs and images. Here’s a video showing it in action: https://ift.tt/O8dBo2N?... , and a demo playground (no signup required!) to test it out: https://ift.tt/Gpsjf8O We started 5 months ago initially trying to make an AI natural language workflow builder that would be a simpler alternative to Zapier or Make.com. However, most of our users seemed to be much more interested in the basic (and not very good) document extraction feature we had. Seeing how people were spending hours a day manually extracting data from pdfs inspired us to build what has become Midship! The problem is that despite all our progress in software, huge amounts of business data still lives in PDFs and images. Sure, you can OCR them, but getting clean, structured data out is still painful. Most existing tools just give you a blob of markdown - leaving you to figure out which parts matter and how they relate. We've found that combining OCR with language models lets us do something more useful: extract specific fields and tables that users actually care about. The LLMs help correct OCR mistakes and understand context (like knowing that "Inv#" and "Invoice Number" mean the same thing). We have two main kinds of users today, non-technical users that extract data via our web app and developers who use our extraction api. We were initially focused on the first one as they seemed like an underserved part of the market, but we’ve received a lot of interest from developers who face the same issues. For pricing, we currently charge a monthly Saas fee per seat for the web app and a volume based pricing for the API. We’re really excited to share what we’ve built so far and look forward to any feedback from the community!