You can make a difference in the Apple Support Community!

When you sign up with your Apple Account, you can provide valuable feedback to other community members by upvoting helpful replies and User Tips.

Looks like no one’s replied in a while. To start the conversation again, simply ask a new question.

Why doesn't Cyrillic (Russian, Bulgarian, Serbian etc.) OO-based spell check work?

Hi!


The spell-check in OS X has official support for Russian, but it doesn't for Bulgarian, Serbian, Macedonian etc. One can of course use dictionaries from Open Office, install them, and in theory everything should be fine.


I find it that none of the OO-based dictionaries for Cyrillic languages work. No words get underlined -- whether they are spelled correctly or incorrectly. I also installed the Open Office dicitonary for German and it worked without a hitch.


If you are users of Cyrillic languages and if you have added your own .dic and .aff files to your Mac, could you please confirm that everything is working fine for you?


(And, yes, my Cyrillic dictionaries show up in the language preferences and they are all enabled.)


Any tips will be greatly appreciated.


All best

Tench

Mac Pro, OS X Mountain Lion

Posted on Jul 31, 2013 5:51 AM

Reply
Question marked as Top-ranking reply

Posted on Dec 29, 2013 1:01 PM

@tenchpsych


Hi there,

The problem with Open Office Dictionaries for Slavic, cyrilic languages, is the encoding CP1251 which OS X does not like and will not combine with its native Western-latin encoding. However OS X likes encoding UTF-8 and if you follow the steps below you will have a working cyrilic spell check-text-as you type, in OS X system wide.


I just noticed that the sr-SR package is already UTF-8 so you can use it out of the box.

However the Macedonian, Bulgarian and Russian package are non UTF-8.



To make OO non-UTF8 dictionaries work, download them from the Open Office website (dictionary Extentions), change the .oxt extention into .zip. Unpack them, and browse the unpacked folder. You will only need the following 2 files (depanding which language you download, e.g. mk_MK.dic and mk_MK.aff (or bg_bg, ru_ru dic and aff)


Now here is the catch: To convert CP1251 you need a converter, there are a couple of them online (google Convert cp-1251 into UTF-8), otherwise if you have Windows, open new Notepad document - copy-paste the text and save as. in the dropdown menu as UTF-8)


If you are doing it in OS X open both files in TextEdit, select all, copy the unreadable text and paste it in the online converter. Hit convert and copy and paste it back in the .dic and .aff files, respectively. Don't forget to change the encoding command at the top of the .aff file it should read at the top * SET UTF-8 * . Save and close both files. Copy both files into /System/Library/Spelling and restart OS X.


Now you have a system wide spellcheck in cyrilic, however it will not work if you choose the Multilanguage option, i.e. in TextEdit to check the spelling (for e.g. MK, SR or BG) you must go to Edit/SpellingandGrammar/ShowSpellingAndGrammar and Switch the language mannualy. I repeat Automatic by Language will not work for Latin-Cyrilic combination.

These are not very sophisticated spellchecers and may easily miss some mistakes, thus double-check your spelling.

3 replies
Question marked as Top-ranking reply

Dec 29, 2013 1:01 PM in response to tenchpsych

@tenchpsych


Hi there,

The problem with Open Office Dictionaries for Slavic, cyrilic languages, is the encoding CP1251 which OS X does not like and will not combine with its native Western-latin encoding. However OS X likes encoding UTF-8 and if you follow the steps below you will have a working cyrilic spell check-text-as you type, in OS X system wide.


I just noticed that the sr-SR package is already UTF-8 so you can use it out of the box.

However the Macedonian, Bulgarian and Russian package are non UTF-8.



To make OO non-UTF8 dictionaries work, download them from the Open Office website (dictionary Extentions), change the .oxt extention into .zip. Unpack them, and browse the unpacked folder. You will only need the following 2 files (depanding which language you download, e.g. mk_MK.dic and mk_MK.aff (or bg_bg, ru_ru dic and aff)


Now here is the catch: To convert CP1251 you need a converter, there are a couple of them online (google Convert cp-1251 into UTF-8), otherwise if you have Windows, open new Notepad document - copy-paste the text and save as. in the dropdown menu as UTF-8)


If you are doing it in OS X open both files in TextEdit, select all, copy the unreadable text and paste it in the online converter. Hit convert and copy and paste it back in the .dic and .aff files, respectively. Don't forget to change the encoding command at the top of the .aff file it should read at the top * SET UTF-8 * . Save and close both files. Copy both files into /System/Library/Spelling and restart OS X.


Now you have a system wide spellcheck in cyrilic, however it will not work if you choose the Multilanguage option, i.e. in TextEdit to check the spelling (for e.g. MK, SR or BG) you must go to Edit/SpellingandGrammar/ShowSpellingAndGrammar and Switch the language mannualy. I repeat Automatic by Language will not work for Latin-Cyrilic combination.

These are not very sophisticated spellchecers and may easily miss some mistakes, thus double-check your spelling.

Dec 29, 2013 3:19 PM in response to vele_1

EDIT:


There is a way in TextEdit to change encoding.


For Macedonian and Bulgarian:


1. Open TextEdit goto Preferences/OpenAndSave in Opening Files goto Custemize Encoding List and tick Cyrilic Windows and Cyrilic (KOI-8R). Then choose from the menu list Cyrilic (windows)


Then open the dic and aff files (mk_MK and bg_BG). The text should be readble Cyrilic. In File make Duplicate. Then Save and choose at the bottom UTF-8. Change the file names to their originals mk_MK and bg_BG.


For Russian:

Repeat step 1 but instead choose Cyrilic (KOI-8R) (ru_RU.dic and ru_RU.aff).

!The Russian open office dictionary is in KOI-8


For Serbian no need to do anything, as the files are already UTF-8.

Why doesn't Cyrillic (Russian, Bulgarian, Serbian etc.) OO-based spell check work?

Welcome to Apple Support Community
A forum where Apple customers help each other with their products. Get started with your Apple Account.