Monthly Archives: July 2015

SSSplit Improvements

Introduction As part of my continuing work on Partridge, I’ve been working on improving the sentence splitting capability of SSSplit – the component used to split academic papers from PLosOne and PubMedCentral into separate sentences. Papers arrive in our system as big blocks of text with the occasional diagram, formula or diagram and in order … Continue reading SSSplit Improvements

Posted in demo, improvements, java, PhD, regex, split, sssplit, test, Work Tagged with: , ,

SSSplit Improvements

Introduction As part of my continuing work on Partridge, I’ve been working on improving the sentence splitting capability of SSSplit – the component used to split academic papers from PLosOne and PubMedCentral into separate sentences. Papers arrive in our system as big blocks of text with the occasional diagram, formula or diagram and in order … Continue reading SSSplit Improvements

Posted in demo, improvements, java, PhD, regex, split, sssplit, test, Work Tagged with: , ,