Hi,
I pushed a few patches that add disambiguation and citation sorting
support.
Moreover pandoc (the svn tree) can now be compiled with citeproc
support without patches (by just running configure with the “-f
citeproc” flag).
The disambiguation bits are supposed to be a real implementation, and
should be supporting every syntactically correct style.
With a major exception: it doesn’t (and will not) support the
"add-title" options (I’ll write a second message for it, since I’m
going to propose to get rid of it).
There are minor issues, which I think are related to the semantics of
those disambiguation options: if a style has short names (or long with
"initialize-with") and “add-given-names” is not present, two list of
authors with the same last name and different given-names (or
initials) will not received a year suffix, since the author is not the
same - (given names and possibly initials are different).
Or, for instance, the apa style has long names with initials in the
bibliography, but has the “add-given-names” as a citation option. Now,
if two authors share names and initials, the citation will be
disambiguated with the given-name, but the bibliography will not, so
you can distinguish the citations but you cannot understand which
citation refers to which bibliographic entry. See below an example.
Here you’ll find a test suite, built upon the zotero issues examples:
http://gorgias.mine.nu/csl/disambig/
For instance, if you run:
pandoc --mods disamb.mods --csl apa.csl disamb.markdown
you will get:
Giovanni Pascuzzi, 2004; Giuseppe Pascuzzi, 2004;
when citing Pascuzzi2004a and Pascuzzi2004b, but the bibliography is:
Pascuzzi, G. (2004). The Brother’s Book.
Pascuzzi, G. (2004). First Book.
See the output here:
http://gorgias.mine.nu/csl/disambig/apa_test.html
The test suite is this one:
pandoc --mods disamb.mods --csl disamb.csl disamb.markdown
and this is the output:
http://gorgias.mine.nu/csl/disambig/test_suite.html
I think this is the way of adding the title (as the Modern Language
Association (mla) style does).
Performance: it is an issue indeed, but I think my approach is overall
efficient.
In this file:
http://gorgias.mine.nu/csl/disambig/disamb_performance.markdown
there are about 800 citations of the 8 colliding references of the
disamb.mods collection:
[14:15:25]$ time pandoc --mods disamb.mods --csl apa.csl disamb_performance.markdown > /dev/null
real 0m2.539s
user 0m2.412s
sys 0m0.036s
This is not bad, after all.
Nonetheless I’m really looking forward to seeing other implementations
for possible improvements.
Collapsing is the only major missing features - together with some
minor formatting options, to be taken care together within pandoc,
mostly. Which means citeproc-hs can now have some broader testing and
debugging.
Please let me know about any issue you may find.
Thanks,
Andrea