For information on exhibition and sponsorship opportunities at the convention, contact Sharon Cordesse at firstname.lastname@example.org
Download the OSCON Sponsor/Exhibitor Prospectus
View a complete list of OSCON contacts
Attendee prerequisites for this tutorial are listed below.
Growing exponentially over the last decade, Unicode text now comprises over 95% of the documents retrieved over the web, while in other collections, it is often 100% Unicode. This tutorial shows Perl programmers how to manage Unicode data.
Key differences between bytes and characters affect numerous basic Perl string operations. Automatic encoding and decoding between multiple external formats make dealing with Unicode I/O easy. Your programs can be written with literal UTF-8 strings and even identifiers, but alternate and sometimes more convenient ways of expressing Unicode code points symbolically in your strings are available. Normalization of combined characters makes string comparisons trickier with Unicode. The old sort function doesn’t work well on Unicode text; instead, locale-specific sorting algorithms take its place. Unicode pattern-matching is only lightly discussed, because it’s a big enough topic to merit a tutorial of its own.
This tutorial has certain workstation requirements and pre-requisites, including software installations required prior to attending. Click HERE to download the file.
QUESTIONS for the speaker?: Use the “Leave a Comment or Question” section at the bottom to address them.
Tom Christiansen is a programmer, author, and lecturer who’s been
involved with Perl since its initial public release back in 1987. Tom is
the owner of the PERL.COM domain and website, and original author of much
of Perl’s online documentation. Tom is lead author of the
The Perl Cookbook and co-author of Programming Perl, Learning Perl
(2nd edition), and Learning Perl on Win32 Systems, all bestselling titles by
O’Reilly & Associates.
He served two terms on the USENIX Association Board of Directors, and was
president of The Perl Journal. Perl users selected Tom to receive the
first White Camel Award in 1999. In 2000, Members of the Open Source
community voted Tom Best Newbie Helper in the first annual Andover.Net
Slashdot Open Source Community Awards, to honor Open Source pioneers.
Tom holds a Masters degree in Computer Science from the University of
Wisconsin – Madison with a dual specialization in operating systems design
and in computational linguistics. He previously received his Bachelors
degree there in Spanish and Computer Science with minor fields of study in
French, Mathematics, and Music. Tom has lived abroad in England and in
Spain, where he studied Romance Philology, café solo, and vino tinto.
Residing at the western edge of Boulder, Colorado, Tom is an amateur
naturalist who spends most of his summer hiking and camping high in the
wilderness well above 10,000 feet of elevation, wandering about the vast
Colorado Plateau, or relaxing under the glittering kaleidoscope of the
Black Rock Desert’s starkly featureless playa. Over the past five years,
Tom has become especially interested in how the exciting growth of
affordable digital photography has opened up to mere mortals dramatic
artistic opportunities previously possible to only the most dedicated
and persistent of professional photographers, and often not even to them.
Comments on this page are now closed.