Protégé is utf-8 compatible, which means it can process and display utf-8 characters. In the course of editing, many editors prepare their concepts or concept information in Microsoft Word or Excel. They then copy from Microsoft and paste into Protégé. This can cause problems because Microsoft is not purely utf-8 compatible. The paste operation can introduce characters that Protégé does not know how to process. The instructions below show how to avoid these problems.
Thanks to Interwingly for providing this table.
If you are running on a Microsoft platform, or cut and paste from documents produced by Microsoft software, or even allow comments to be posted by people who might be doing one of the above, you need to be aware of the 27 differences, summarized by the following table.
character |
win-1252 decimal |
win-1252 hex |
win-1252 octal |
unicode html |
unicode xml |
unicode url |
||
---|---|---|---|---|---|---|---|---|
€ |
128 |
80 |
200 |
|
|
%E2%82%AC |
||
‚ |
130 |
82 |
202 |
|
|
%E2%80%9A |
||
ƒ |
131 |
83 |
203 |
|
|
%C6%92 |
||
„ |
132 |
84 |
204 |
|
|
%E2%80%9E |
||
… |
133 |
85 |
205 |
|
|
%E2%80%A6 |
||
† |
134 |
86 |
206 |
|
|
%E2%80%A0 |
||
‡ |
135 |
87 |
207 |
|
|
%E2%80%A1 |
||
ˆ |
136 |
88 |
210 |
|
|
%CB%86 |
||
‰ |
137 |
89 |
211 |
|
|
%E2%80%B0 |
||
Š |
138 |
8A |
212 |
|
|
%C5%A0 |
||
‹ |
139 |
8B |
213 |
|
|
%E2%80%B9 |
||
Œ |
140 |
8C |
214 |
|
|
%C5%92 |
||
Ž |
142 |
8E |
216 |
|
|
%C5%BD |
||
‘ |
145 |
91 |
221 |
|
|
%E2%80%98 |
||
’ |
146 |
92 |
222 |
|
|
%E2%80%99 |
||
|
|
|
|
<code>&</code> |
& |
|
||
|
|
|
|
& |
& |
|
||
|
|
|
|
& |
& |
|
||
|
|
|
|
& |
& |
|
||
|
|
|
|
& |
& |
|
||
|
|
|
|
& |
& |
|
||
|
|
|
|
& |
& |
|
||
|
|
|
|
& |
& |
|
||
|
|
|
|
& |
& |
|
||
|
|
|
|
& |
& |
|
||
|
|
|
|
& |
& |
|
||
|
|
|
|
& |
& |
|
||
|
|
|
|
& |
& |
|
||
|
|
|
|
& |
& |
|
Thanks to Liverpool John Mores University for providing the following instructions:
For Word and Excel 2007, the instructions are the same