The point I am making is a very practical one,
namely that when someone asks here in the
forums "How can I stop my accented characters
from looking like Chinese at the other end?" ...
Tom! Please reread the original post & initial replies carefully. This is not an accurate representation of what Felipe asked. Your initial reply acknowledged you addressed only a part of the overall issue ("at least for Win Outlook"). I tried to make it explicitly clear that I was supplying
additional information ("In addition to what the others have said") that might be helpful with understanding the overall issue, not just how it impacted Outlook users.
As the discussion has progressed, I have tried to do more of that, which unfortunately means delving into how the terminology is sometimes used without much regard for accuracy or appropriate context. Regarding the
Glossary of Unicode Terms entry for
Rich Text, the full entry illustrates exactly what I have been talking about:
Rich Text. Also known as styled text. The result of adding information to plain text. Examples of information that can be added include font data, color, formatting information, phonetic annotations, interlinear text, and so on. The Unicode Standard does not address the representation of rich text. It is expected that systems and applications will implement proprietary forms of rich text. Some public forms of rich text are available (for example, ODA, HTML, and SGML). When everything except primary content is removed from rich text, only plain text should remain.
Part of the confusion is because "Rich Text" appears capitalized in headings & in Apple menus, making it difficult to determine if it means the generic, lower case "rich text" term found in carefully worded documentation (including much of Apple's) or to the proprietary "Rich Text Format" of Microsoft. It would be helpful if notes & commentary by users were as careful about this, but I have little hope that will ever happen.
The same goes for "ASCII" which is in fact
only a 7-bit coded character set that (quoting the same Glossary of Unicode Terms), "has been incorrectly used to refer to various 8-bit character encodings that include ASCII characters in the first 128 positions." Similarly, "plain text" properly refers to
any character set, the only requirement being that it be free of other formatting or structural information.
We all occasionally get sloppy about this, using "Plain Text" to mean ASCII or visa versa, or making some careless statement about how using plain text frees us from any concerns of text encoding problems, which it most certainly does not.