Citum: a reimagining of CSL

Bruce_D_Arcus1 · January 29, 2026, 2:30pm

UPDATE

I’ve decided to distance this project from CSL, so the new name, and org, is citum. DNS is currently propagating, but this will be the new apex url:

I believe I’ve solved, or at least have a plan to resolve, all of the issues and pain points we identified in this virtual meeting we had in 2022.

The project is almost feature complete, which is to say supports the features in 1.0 plus many more; multilingual, sectional bibs, advanced EDTF dates, etc.

Earlier Context

A few years ago, I started experimenting with a new approach to evolving CSL in GitHub - bdarcus/csln: Reimagining CSL .

But I’m busy, and am amateur programmer with aspirations that (far) exceed my time or skills.

In the past week, however, I’ve been doing a deep dive into new agentic coding tools; notably using the latest Claude’s Opus and Google Gemini models.

After I got more comfortable with how to exploit these tools, I threw this new project together in less than 24 hours, and I have to say: I am super impressed. It already does much more than what I achieved with the earlier project (though is borrowing code from it).

I should add, however, a huge question for me remains whether the 100% fidelity claim mentioned in the README is even possible. I aim to figure this out over the coming weeks (though early progress is slow, so it may take many weeks!)!

Basically, I had these tools analyze how to extend my earlier experiments (which got pretty far actually) in order to bring the vision to completion.

Perhaps the most interesting possibility this opens up, I think, is reflected in the contributing section of the README, which you can see if you create a new issue and select “domain expert.”

The prior art analysis is also super interesting. It’s the result of me asking how to synthesize all the information in the respective code bases, the csln repo issue tracker, and in the spec documents for CSL/M 1.0. That’s now incorporated in to the roadmap (this file, while human readable, is aimed at the LLM tools).

Here’s a parallel project with the start of a rust-based server, upon which I intend to build a client UI based on an idea I’ve previously talked about, that I will make sure the core code supports (for live-previewing and such). Here’s the browsing UI I imagine:

And this is a representation of the creation wizard I’ve previously discussed, with the idea being it has live previewing.

Bruce_D_Arcus1 · January 31, 2026, 12:36pm

I’ve added a couple of design docs to address some long-standing issues and questions:

How to deal with style reuse and duplication; here there will be no dependent styles, but a more composable alternative. Notably also, iterative development is now driven by the priorities and knowledge reflected in actually-existing 1.0 styles.
How to make finding and creating styles easier.

Bruce_D_Arcus1 · February 2, 2026, 10:32pm

I got a basic demo of the rust server + sveltekit client front-end working. The previews are (mostly) generated dynamically on the server.

As I said above, I’m trying to develop these in parallel so that they are fully complementary, though will now turn back to the core code.

Bruce_D_Arcus1 · February 8, 2026, 7:40am

The last few weeks I’ve focused on improving the XML based migration of styles to these new, quite different models. When I saw multiple high end models fail, I decided to pull the plug on the approach, which was wasting a lot of time and resources.

Instead, I had the strong hunch an earlier idea I had of inferring templates from formatted citeproc-js, would be simpler and more reliable.

So, I had the new Opus 4.6 model run an analysis of the two approaches, and have an architect agent propose a plan.

That plan is here; it uses the XML for what it’s good at, and a new JS inferred script for the templates.

That does includes initial experiments that justify the change in approach:

The inferrer validates that the hard problem (template structure) is better solved by observing output than by parsing XML. The XML compiler’s 0% bibliography match was not a bug — it was evidence that procedural-to-declarative translation via macro flattening is fundamentally harder than reverse-engineering from rendered output.

Next step is to hook up this script to other scripts in order test full rendering impact.

Bruce_D_Arcus1 · February 8, 2026, 6:03pm

Latest updates:

Per above, I ditched the approach of trying to parse 1.0 XML macros and templates and map them to the very different new model; instead the focus will be deriving styles from the output (using citeproc-js). It’s much easier to reason about and debug parsing common input data than it is the insanely complex 1.0 styles.
I also found an extension of this idea to have an LLM not only do a good job of creating a style, but also that it could iteratively improve the code to match the expected output. Reflected in a new styleauthor agent and skill. More.

Bruce_D_Arcus1 · February 10, 2026, 3:54pm

And this wrinkle from APA, not supported in CSL 1.0, now works (along with integral/narrative citations generally)!

=== apa-7th.yaml ===

CITATIONS (Non-Integral):
  [pew_social_media] (Auxier & Anderson, 2021)
  [berger_luckmann] (Berger & Luckmann, 1966)
  [vaswani_attention] (Vaswani et al., 2017)
  [aad_atlas_higgs] (Aad et al., 2012)

CITATIONS (Integral):
  [pew_social_media] Auxier and Anderson (2021)
  [berger_luckmann] Berger and Luckmann (1966)
  [vaswani_attention] Vaswani et al. (2017)
  [aad_atlas_higgs] Aad et al. (2012)

Allen · March 11, 2026, 2:08pm

yeah this makes sense. forcing CSL 1.0 XML macros into a totally different model was always going to be brittle. macro flattening + conditionals = semantic mess. that 0% bibliography match wasn’t a bug, it was proof the translation layer was wrong.

inferring templates from citeproc-js output is just smarter. you’re treating rendered output as the contract and reverse-engineering structure from there — that’s a problem LLMs can actually handle.

the fact that APA integral/narrative citations now work is the real win. if CSL 1.0 couldn’t express it cleanly but your new pipeline can, you’re clearly on the right track.

Bruce_D_Arcus1 · March 11, 2026, 2:27pm

It turns out they (particularly the latest codex models) are really good at this. I added a skill that just has them iteratively refine the style against the target, while simultaneously looking for code improvement opportunities. So a good upgrade “wave” sees big jumps in the style metrics AND useful code improvements.

I’ve since broadened that “authority” system beyond CSL; so there are now a few styles I had it port from biblatex, since they can’t be represented fully in CSL; in those cases, it uses the biblatex output as the source of truth.

Bruce_D_Arcus1 · March 11, 2026, 4:39pm

Also, by way of update, the citum-core project is now pretty much feature complete, with everything listed here now implemented.

Notable that a lot of these features don’t exist in CSL.

It also passes all strict clippy linting, and includes over 600 automated tests.

There are just some little things I need to do before calling this 1.0, and publishing libraries and relevant binaries in the right places.

What I need now: human testing above all.

Also in the realm of cool news, @zepinglee is looking into integrating this into latex.

And I have offered to hand-off that proof-of-concept project to him to develop.

Bruce_D_Arcus1 · March 12, 2026, 2:44pm

More on the infrastructure end of the project, the core repo now has ~700 automated tests. But they’re not super transparent to humans.

So I’ve integrated a solution for that into the Github CI:

It’s not complete, but they document the behavioral logic for two core crates: the engine (processor), and the migration one to translate CSL styles into Citum styles.

A number of core tests, BTW, were ported from the CSL and CSL-M test suites.

If anyone catches some we missed, let me know.

Bruce_D_Arcus1 · March 18, 2026, 3:54pm

I stumbled into implementing a few related features that in retrospect make perfect sense, but which I wasn’t much thinking about since I am a mono English language scholar, and I was never really involved in CSL style development and maintenance.

Together they should make the explosion of styles in CSL (language and other small variants of big styles like APA or Chicago; and the problems with dependent styles in general) obsolete.

The three primary changes:

First, what I call “presets” in Citum aren’t just aliases, as dependent styles in CSL are. They define default behavior, which can be locally overridden. So you want Chicago author-date but a different et al rule? You can very concisely represent that. And while I am currently adding support for it at the style level, it’s pervasive throughout the design, so that feature ends up being IMO a superior solution to both dependent styles and macros.
Second, the biggest styles, with the most dependent styles in the CSL world, are compiled into the engine. So users don’t have to worry about finding, keeping track of, updating these styles.
Finally, the locale system can also be locally overridden. So language-variant styles (like chicago-author-date-de.csl), with variant locale files, should also no longer be needed.

The PR, now merged.

github.com/citum/citum-core

feat(schema): style preset architecture (csl26-fsjy) (#402)

main ← feat/csl26-fsjy-style-preset-loops

opened 05:30PM - 18 Mar 26 UTC

bdarcus

+4189 -803

## Summary This PR adds two new composition features for Citum styles: - **sty…le presets** for whole-style reuse and thin wrappers - **locale overrides** for locale-distinguished variants without cloning an entire style It also makes the engine the canonical runtime resolution boundary for preset-backed styles, restores fidelity/tooling parity, and documents the remaining policy follow-up work. ## What This PR Adds - preset-backed styles can inherit from a compiled-in base style and override only the delta - nested style overrides now merge structurally instead of replacing whole blocks - the engine resolves preset-backed styles during rendering, so CLI/tests/tooling all observe the same effective style - locale overrides can carry style-specific locale adjustments such as the new `de-DE-chicago` example - `citum-analyze` now quantifies the corpus-level migration savings unlocked by presets and locale overrides ## Why It Matters These features reduce one-off conversion work in two different ways: - **style presets** let us model style families and behavioral wrappers without copying full styles - **locale overrides** let us represent locale-distinguished variants without maintaining a separate full style file That gives us a more declarative migration path away from CSL's mix of dependent aliases, template-linked wrappers, and localized near-duplicates. ## Corpus Impact Using the new analyzer report on the current `styles-legacy/` snapshot: ```bash cargo run -p citum-analyze --bin citum-analyze -- \ styles-legacy --quantify-savings --json ``` | Metric | Count | Notes | | --- | ---: | --- | | Independent CSL styles | 2,844 | top-level standalone styles | | Dependent CSL styles | 7,987 | clear alias/wrapper styles with an independent parent | | Unique dependent parents | 298 | distinct parent families referenced by dependents | | Dependent alias savings | 7,987 | hard lower-bound conversions avoided by composition | | High-confidence locale override savings | 11 | independent locale variants with a matching base slug, e.g. `chicago-author-date-de` | | Possible locale override savings | 2,515 | independent styles with `default-locale`; informative, but not all are pure locale-only variants | | Preset wrapper opportunity | 2,140 | template-linked independent styles, excluding the 11 high-confidence locale variants | | Lower-bound avoided conversions | 7,998 | dependent aliases + high-confidence locale variants | | Upper-bound avoided conversions | 10,138 | lower bound + broader preset-wrapper opportunity | Top parent families by combined dependent + wrapper opportunity in this snapshot: - `apa`: 906 - `elsevier-harvard`: 715 - `elsevier-with-titles`: 686 - `elsevier-vancouver`: 590 - `springer-vancouver-brackets`: 475 These numbers are intentionally presented as a **range**. The lower bound counts only the cases we can point to directly as aliases or clearly paired locale variants. The upper bound includes a broader heuristic opportunity bucket for template-linked wrapper styles. ## Implementation Highlights - style preset resolution now lives in the engine - preset merge order is `base -> variant -> local override` - object fields merge recursively; arrays, scalars, and explicit `null` replace inherited values - Chicago Notes and Taylor & Francis preset-backed styles now round-trip through fidelity checks correctly - JS verification/report tooling now mirrors Rust preset resolution semantics ## Follow-Up Work This PR intentionally leaves the policy questions in `csl26-wp6y` open: - authored identity vs resolved behavior in reports/tooling - comparator / benchmark authority for preset-backed wrappers - when a thin wrapper should be treated as effectively its own style - guardrails for large local override layers ## Verification - `cargo fmt --check` - `cargo clippy --all-targets --all-features -- -D warnings` - `cargo test` - fidelity checks restored for preset-backed styles Note: `cargo nextest run` hung in test discovery in this environment, so I used the full `cargo test` fallback after confirming the analyzer/tests and workspace gate otherwise passed cleanly.

I’m was more cautious with this work, because it’s a big change with far-reaching implications (not easy to revert, for example).

One thing I had the agent do to help with decision-making is to update the citum-analyze binary to include metrics that might allow a reasonable approximation of potential benefits compared to a CSL approach to this. A review agent then called those results conservative. So on balance I think the benefits largely outweigh the small to moderate increase in complexity.

I think the only way to really test, though, is to run a bulk process with an enhanced citum-migrate where the code + agent figures out how to optimize. We’ve made a start at that, but not really dug in.

It’s worth noting that I’m implementing some of the features to enable the new style wizard I’m working on. So iterate back-and-forth between the two projects to co-evolve them as well.

That hub API UX is hard, but I should be able to demo it in the coming weeks. Here’s a screenshot of a working UI (which, BTW, points to a cool feature that the entire UI is built on: live WASM-based previewing that is super fast):

Bruce_D_Arcus1 · March 21, 2026, 10:56pm

Making this very early alpha available in case people want to try it out.

https://hub.citum.org

Notable:

“Dependent styles” are just entries in a registry, which are stored in the db.
There is WASM-based live previewing everywhere, including in …
… the wizard, which actually works, and I think shows what the schema design makes possible.

The big caveat is I’m still not sure if this iteration of the wizard is good enough for complex real world styles. As I said above, this UX is really hard, and I’ve just been focused on getting it to work, rather than rigorously testing it myself.

If this or an iteration of it does end up working well, however, the idea of the hub is it would be the easier to use and maintain successor to the CSL solutions (the styles repo, the editor, etc).

So you can imagine extending this to include dedicated maintainer roles scoped by styles or categories, style versioning, etc; users will be able to “fork” and share styles, bookmark them, etc, which can also sync with their local citum installation.

Bruce_D_Arcus1 · April 21, 2026, 9:42pm

Another new feature is style “profiles.” I’ve kind of gone back-and-forth on the question of style inheritance, wanting to keep things as simple as possible, but add value where I can. This reflects an iteration of that.

So the processor (engine) embeds the ~20 “base” (Chicago, APA, IEEE, etc.) and profile styles; “dependent styles” are just entries in a registry, which is also embedded.

Bruce_D_Arcus1 · April 29, 2026, 4:49pm

At this point I’m running out of open issues that might warrant a schema change. There are some complex features, however, that I don’t have much personal experience with, most notably around multilingual functionality. Hoping some people with expertise in this area can vet the current schema and processor details, and test it all out.

Bruce_D_Arcus1 · May 19, 2026, 12:55pm

Crates and jsr.io WASM/JS/TS packages now published, so easier to install etc …

… including a bash script to install the CLI binary if you don’t have a rust toolchain installed.

curl -fsSL https://github.com/citum/citum-core/releases/latest/download/install.sh | sh

Bruce_D_Arcus1 · May 19, 2026, 2:34pm

One of the recent major under-the-hood work on the codebase has been adding support for forward-compatibility.

This is actually tricky, given the whole point of the implementation is it’s rigorously type-safe, that the JSON schema are generated from this code and should be useful for validation and auto-completing editing, etc.

So how, then, do you allow an engine to consume unknown configuration options, etc. while preserving the above?

In some ways this becomes easier to answer if we only assume a single implementation. The scope, then, narrows down to effectively a single scenario: a mismatch between the release cadence of downstream consumers, and core code and styles. E.g. there’s a new release that adds a small change, and someone somewhere is now using slightly foreign styles. We don’t want the process to fail in that scenario.

Now, the way this should work is the processor still runs as expected, but if there’s an unknown Enum etc, it reports that.

I put the provisional note at the top of the spec to indicate my uncertainty about if this is the right balance. Only real world testing and usage, I think, will really determine the answer.

If it turns out to be a bad idea, it should be easy enough to remove. Or if is instead it requires some adjustments in the scope of that contract, that should also be pretty easy to implement.

Bruce_D_Arcus1 · May 22, 2026, 2:21pm

Along with publishing the crates and WASM + TS/JS bindings, I’ve spent time the past few days enhancing CSL conversion in the citum-migrate tool, and the related LLM skill. The page includes instructions, as well as example prompts for the sort of things it should accommodate.

Bruce_D_Arcus1 · June 6, 2026, 7:32pm

Working on something for integrating with word-processors, which helps further test and refine the API.

Here’s the current state of the citation-insertion UI; note the citation-mode options, and the live preview at the bottom. This the APA style, and it correctly renders the conjunction “and” in-text citation, and “&” elsewhere.

Also, this is using the Zotero “provider”, which is caching the Zotero DB via the web API. But the idea is it wouldn’t be tied to any particular reference manager.

Topic		Replies	Views
Columbia Libraries, Mendeley Collaboration on prototype CSL editor CSL Development	22	1460	April 8, 2020
Towards a simpler and extensible CSL 2.0; or What can we learn from citeproc (hs) and djot? CSL Development	10	1226	January 2, 2024
CSL Funds & Projects	65	4181	July 14, 2019
CSL editor CSL Development	5	356	April 13, 2009
Pursuing grants for CSL-related development CSL Development	15	392	April 3, 2013

UPDATE

Earlier Context

Related topics