From Misha.Wolf@reuters.com Mon Dec 17 19:15:14 2001 From: Misha.Wolf@reuters.com (Misha.Wolf@reuters.com) Date: Mon, 17 Dec 2001 19:15:14 +0000 Subject: [I18n-sig] 20th Unicode Conference, Jan 2002, Washington DC -- Six weeks to go! Message-ID: >>>>>>>>>>>>>>>>>>>>>>>> Just 6 weeks to go! <<<<<<<<<<<<<<<<<<<<<<<< Twentieth International Unicode Conference (IUC20) Unicode and the Web: The Global Connection http://www.unicode.org/iuc/iuc20 January 28-31, 2002 Washington, DC, USA >>>>>>>>>>>>>>>>>>>>>>>>>>> Register now! <<<<<<<<<<<<<<<<<<<<<<<<<<< NEWS * Hotel guest room group rate valid to January 3. * Early bird registration rate valid to January 11. * Visit the Conference Web site ( http://www.unicode.org/iuc/iuc20 ) to check the updated Conference program and register. To help you choose Conference sessions, we've included abstracts of talks and speakers' biographies. * The World Wide Web Consortium (W3C) Internationalization Workshop is taking place in the same venue, on February 1 -- See the Call for Participation ( http://www.w3.org/2002/02/01-i18n-workshop/cfp ) CONFERENCE SPONSORS Agfa Monotype Corporation Basis Technology Corporation Microsoft Corporation Netscape Communications Oracle Corporation Progress Software Corporation Reuters Ltd. Sun Microsystems, Inc. World Bank World Wide Web Consortium (W3C) CONFERENCE VENUE Omni Shoreham Hotel 2500 Calvert Street, NW Washington, DC 20008 USA Tel: +1 202 234 0700 Fax: +1 202 265 7972 GLOBAL COMPUTING SHOWCASE Visit the Showcase to find out more about products supporting the Unicode Standard, and products and services that can help you globalize/localize your software, documentation and Internet content. For details, visit the Conference Web site: http://www.unicode.org/iuc/iuc20 Exhibitors to date include: * Agfa/Monotype Corporation * Multilingual Computing, Inc. * Rasmussen Software, Inc. CONFERENCE MANAGEMENT Global Meeting Services Inc. 8949 Lombard Place #416 San Diego, CA 92122, USA Tel: +1 858 638 0206 (voice) +1 858 638 0504 (fax) Email: info@global-conference.com or: conference@unicode.org * * * * * Unicode(r) and the Unicode logo are registered trademarks of Unicode, Inc. Used with permission. ------------------------------------------------------------- --- Visit our Internet site at http://www.reuters.com Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Reuters Ltd. From Misha.Wolf@reuters.com Thu Dec 20 15:06:08 2001 From: Misha.Wolf@reuters.com (Misha.Wolf@reuters.com) Date: Thu, 20 Dec 2001 15:06:08 +0000 Subject: [I18n-sig] Character Model for the World Wide Web Message-ID: I'm very pleased to be able to announce the publication of a new Working Draft of the Character Model for the World Wide Web: http://www.w3.org/TR/charmod/ An extract from the document follows: Abstract This Architectural Specification provides authors of specifications, software developers, and content developers with a common reference for interoperable text manipulation on the World Wide Web. Topics addressed include encoding identification, early uniform normalization, string identity matching, string indexing, and URI conventions, building on the Universal Character Set, defined jointly by Unicode and ISO/IEC 10646. Some introductory material on characters and character encodings is also provided. Status of this Document This section describes the status of this document at the time of its publication. Other documents may supersede this document. The latest status of this series of documents is maintained at the W3C. This is a W3C Working Draft published between the first Last Call Working Draft of 26 January 2001 and a planned second Last Call. This interim publication is used to document the further progress made on addressing the comments received during the first Last Call. A list of last call comments with their status can be found in the disposition of comments (Members only). Work is still ongoing on addressing the comments received during the first Last Call. We do not encourage comments on this Working Draft; instead we ask reviewers to wait for the second Last Call. We will announce the second Last Call on the W3C Internationalization public mailing list (www-international@w3.org; subscribe). Comments from the public and from organizations outside the W3C may be sent to www-i18n-comments@w3.org (archive). Comments from W3C Working Groups may be sent directly to the Internationalization Interest Group (w3c-i18n-ig@w3.org), with cross-posting to the originating Group, to facilitate discussion and resolution. Due to the architectural nature of this document, it affects a large number of W3C Working Groups, but also software developers, content developers, and writers and users of specifications outside the W3C that have to interface with W3C specifications. This document is published as part of the W3C Internationalization Activity by the Internationalization Working Group (Members only), with the help of the Internationalization Interest Group. The Internationalization Working Group will not allow early implementation to constrain its ability to make changes to this specification prior to final release. Publication as a Working Draft does not imply endorsement by the W3C Membership. It is inappropriate to use W3C Working Drafts as reference material or to cite them as other than "work in progress". A list of current W3C Recommendations and other technical documents can be found at http://www.w3.org/TR. For information about the requirements that informed the development of important parts of this specification, see Requirements for String Identity Matching and String Indexing [CharReq]. Misha Wolf W3C I18N WG Chair -------------------------------------------------------------- -- Visit our Internet site at http://www.reuters.com Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Reuters Ltd. From mal@lemburg.com Thu Dec 20 19:15:51 2001 From: mal@lemburg.com (M.-A. Lemburg) Date: Thu, 20 Dec 2001 20:15:51 +0100 Subject: [I18n-sig] Character Model for the World Wide Web References: Message-ID: <3C2238E7.5ABE45F8@lemburg.com> Misha.Wolf@reuters.com wrote: > > I'm very pleased to be able to announce the publication of a new Working > Draft of the Character Model for the World Wide Web: > http://www.w3.org/TR/charmod/ > > An extract from the document follows: > > Abstract > > This Architectural Specification provides authors of specifications, > software developers, and content developers with a common reference for > interoperable text manipulation on the World Wide Web. Topics addressed > include encoding identification, early uniform normalization, string > identity matching, string indexing, and URI conventions, building on the > Universal Character Set, defined jointly by Unicode and ISO/IEC 10646. > Some introductory material on characters and character encodings is also > provided. Looks like we'll need Unicode normalization support in Python soon in order to reach at least some compatibility with this proposed standard. -- Marc-Andre Lemburg CEO eGenix.com Software GmbH ______________________________________________________________________ Company & Consulting: http://www.egenix.com/ Python Software: http://www.egenix.com/files/python/ From tree@basistech.com Thu Dec 20 19:24:22 2001 From: tree@basistech.com (Tom Emerson) Date: Thu, 20 Dec 2001 14:24:22 -0500 Subject: [I18n-sig] Character Model for the World Wide Web In-Reply-To: <3C2238E7.5ABE45F8@lemburg.com> References: <3C2238E7.5ABE45F8@lemburg.com> Message-ID: <15394.15078.986641.940056@magrathea.basistech.com> M.-A. Lemburg writes: > Looks like we'll need Unicode normalization support in Python > soon in order to reach at least some compatibility with this > proposed standard. I've already started implementing normalization support. If anyone else has also started let me know. -tree -- Tom Emerson Basis Technology Corp. Sr. Computational Linguist http://www.basistech.com "Beware the lollipop of mediocrity: lick it once and you suck forever" From Misha.Wolf@reuters.com Thu Dec 20 20:03:44 2001 From: Misha.Wolf@reuters.com (Misha.Wolf@reuters.com) Date: Thu, 20 Dec 2001 20:03:44 +0000 Subject: [I18n-sig] Call for Papers - 21st Unicode Conference - May 2002 - Dublin Message-ID: Twenty-First International Unicode Conference (IUC21) Unicode and the Web: The Global Connection http://www.unicode.org/iuc/iuc21 May 14-17, 2002 Dublin, Ireland > > > > > > > C A L L F O R P A P E R S < < < < < < < Submissions due: January 11, 2002 Notification date: February 1, 2002 Completed papers due : February 22, 2002 (in electronic form and camera-ready paper form) * * * * * The Unicode Standard has become the foundation for all modern text processing. It is used on large machines, tiny portable devices, and for distributed processing across the Internet. The standard brings cost-reducing efficiency to international applications and enables the exchange of text in an ever increasing list of natural languages. New technologies and innovative Internet applications, as well as the evolving Unicode Standard, bring new challenges along with their new capabilities. This technical conference will explore the opportunities created by the latest advances and how to leverage them, as well as potential pitfalls to be aware of, and problem areas that need further research. We invite you to submit papers which either define the software of tomorrow, demonstrate best practice with today's software, or articulate problems that must be solved before further advances can occur. Papers should discuss subjects in the context of Unicode, internationalization or localization. You can view the programs of previous conferences at: http://www.unicode.org/unicode/conference/about-conf.html Conference attendees are generally involved in either the development, deployment or use of Unicode software or content, or the globalization of software and the Internet. They include managers, software engineers, systems analysts, font designers, graphic designers, content developers, technical writers, and product marketing personnel. THEME & TOPICS Computing with Unicode is the overall theme of the Conference. Presentations should be geared towards a technical audience. Topics of interest include, but are not limited to, the following (within the context of Unicode, internationalization or localization): - UTFs: Not enough or too many? - Security concerns e.g. Avoiding the spoofing of UTF-8 data - Impact of new encoding standards - Implementing Unicode: Practical and political hurdles - Portable devices - Implementing new features of recent versions of Unicode - Algorithms (e.g. normalization, collation, bidirectional) - Programming languages and libraries (Java, Perl, et al) - The World Wide Web (WWW) - Search engines - Library and archival concerns - Operating systems - Databases - Large scale networks - Government applications - Evaluations (case studies, usability studies) - Natural language processing - Migrating legacy applications - Cross platform issues - Printing and imaging - Optimizing performance of systems and applications - Testing applications - XML and Web protocols - Business models for software development (e.g. Open source) SESSIONS The Conference Program will provide a wide range of sessions including: - Keynote presentations - Workshops/Tutorials - Technical presentations - Panel sessions All sessions except the Workshops/Tutorials will be of 40 minute duration. In some cases, two consecutive 40 minute program slots may be devoted to a single session. The Workshops/Tutorials will each last approximately three hours. They should be designed to stimulate discussion and participation, using slides and demonstrations. PUBLICITY If your paper is accepted, your details will be included in the Conference brochure and Web pages and the paper itself will appear on a Conference CD, with an optional printed book of Conference Proceedings. CONFERENCE LANGUAGE The Conference language is English. All submissions, papers and presentations should be provided in English. SUBMISSIONS Submissions MUST contain: 1. An abstract of 150-250 words, consisting of statement of purpose, paper description, and your conclusions or final summary. 2. A brief biography. 3. The details listed below: SESSION TITLE: _________________________________________ _________________________________________ TITLE (eg Dr/Mr/Mrs/Ms): _________________________________________ NAME: _________________________________________ JOB TITLE: _________________________________________ ORGANIZATION/AFFILIATION: _________________________________________ ORGANIZATION'S WWW URL: _________________________________________ OWN WWW URL: _________________________________________ ADDRESS FOR PAPER MAIL: _________________________________________ _________________________________________ _________________________________________ TELEPHONE: _________________________________________ FAX: _________________________________________ E-MAIL ADDRESS: _________________________________________ TYPE OF SESSION: [ ] Keynote presentation [ ] Workshop/Tutorial [ ] Technical presentation [ ] Panel PANELISTS (if Panel): _________________________________________ _________________________________________ _________________________________________ _________________________________________ _________________________________________ _________________________________________ _________________________________________ _________________________________________ TARGET AUDIENCE (you may select more than one category): [ ] Content Developers [ ] Font Designers [ ] Graphic Designers [ ] Managers [ ] Marketers [ ] Software Engineers [ ] Systems Analysts [ ] Technical Writers [ ] Others (please specify): _________________________________________ _________________________________________ LEVEL OF SESSION (you may select more than one category): [ ] Beginner [ ] Intermediate [ ] Advanced Submissions should be sent by e-mail to either of the following addresses: papers@unicode.org info@global-conference.com They should use ASCII, non-compressed text and the following subject line: Proposal for IUC 21 If desired, a copy of the submission may also be sent by post to: 21st International Unicode Conference c/o Global Meeting Services, Inc. 8949 Lombard Place #416 San Diego, CA 92122 USA Tel: +1 858 638 0206 Fax: +1 858 638 0504 CONFERENCE PROCEEDINGS All Conference papers will be published on CD. Printed proceedings will be offered as an option. EXHIBIT OPPORTUNITIES The Conference will have an Exhibition area for corporations or individuals who wish to display and promote their products, technology and/or services. Every effort will be made to provide maximum exposure and advertising. Exhibit space is limited. For further information or to reserve a place, please contact Global Meeting Services at the above location. CONFERENCE VENUE The Burlington Hotel Upper Leeson Street Dublin 4 Ireland Tel: +353 1 660 5222 Fax: +353 1 660 8496 THE UNICODE CONSORTIUM The Unicode Consortium was founded as a non-profit organization in 1991. It is dedicated to the development, maintenance and promotion of The Unicode Standard, a worldwide character encoding. The Unicode Standard encodes the characters of the world's principal scripts and languages, and is code-for-code identical to the international standard ISO/IEC 10646. In addition to cooperating with ISO on the future development of ISO/IEC 10646, the Consortium is responsible for providing character properties and algorithms for use in implementations. Today the membership base of the Unicode Consortium includes major computer corporations, software producers, database vendors, research institutions, international agencies and various user groups. For further information on the Unicode Standard, visit the Unicode Web site at http://www.unicode.org or e-mail * * * * * Unicode(r) and the Unicode logo are registered trademarks of Unicode, Inc. Used with permission. ------------------------------------------------------------- --- Visit our Internet site at http://www.reuters.com Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Reuters Ltd.