Foma

Foma is a Russian corpus compiled from the online portal of the Russian Orthodox magazine Foma (foma.ru), excluding Q&A content; collected in December 2025.

Source

About the portal (foma.ru)

The corpus source portal foma.ru is part of the Orthodox media project “Foma”, which positions itself as “Orthodox media for those who doubt” and aims to explain Orthodox faith and church life in a modern, accessible language.

Popularity and reach

Editorial and organization

Rubrics and thematic focus

“Foma” describes its specialization as cultural and educational. The publication’s typical focus includes religion, society, culture, history, and related public issues, while it avoids discussing day-to-day secular politics.

Since when online

Significance

Document structure

Linguistic annotation

The corpus is annotated in the Universal Dependencies (UD) framework using UDPipe (lemmatization and UPOS tagging; morphological features in UD FEATS format). The tagger/lemmatizer model is based on UD Russian SynTagRus.

Tagset and annotation notes: corpora.fisun.org/corpus-pages/tagset.html.

Corpus attributes

Positional attributes: word, lemma, pos, morph, lc.

Size (current index)

ItemCount
Tokens29,890,878
Words23,434,351
Sentences (<s>)1,877,821
Documents (<doc>)31,744

Lexicon sizes

AttributeCount
word687,581
lemma374,911
pos17
morph553
lc607,073

Text types (metadata inventory)

The corpus defines <doc> with 9 attributes and <s> with 1 attribute (s.id).

StructureAttributeDistinct values
<doc>doc.author529
<doc>doc.language1
<doc>doc.pubyear5,576
<doc>doc.rubric373
<doc>doc.source1
<doc>doc.status200
<doc>doc.text_id31,744
<doc>doc.title31,647
<doc>doc.url31,744
<s>s.id1

How to cite

Fisun, Roman. 2025. Foma: Russian Orthodox magazine corpus. Compiled from foma.ru (data collected in December 2025). Available at: https://corpora.fisun.org/ (corpus name: foma). Accessed: <YYYY-MM-DD>.

Software

Terms of use

Access to this corpus is restricted (password-protected) and provided on an “as is” basis for research and educational use only. This service does not grant any license or other rights to the underlying texts.

The corpus (including any excerpts, downloads, or derived copies of the original texts) is not freely distributable. Reproduction, redistribution, republication, mirroring, or making the content publicly available is prohibited unless you have explicit permission from the respective rights holders and/or the source website.

Copyright and any other rights in the original texts remain with the respective source website and/or their authors. Users are solely responsible for ensuring that any use complies with applicable law and the terms of the original source.

The maintainer makes no warranties regarding completeness, accuracy, fitness for a particular purpose, or continued availability of the service.

Contact

Maintainer: roman.fisun@ur.de