Shuttersock’s engineering blog Shutterbits features an interview with me by Dan McCormick titled When a Space Is Not Just a Space. We discuss the interesting complexities of Unicode whitespace and how to parse it using regular expressions, with examples in Java and Perl.


Nova Patch (@novapatch) is a principal engineer at Shutterstock, specializing in internationalization, multilingual search, and building products that support the world’s languages, writing systems, and cultures. They are an open source developer, contributor to the Unicode CLDR, and member of the Unicode Consortium.