Svetlin Nakov – Research Projects

This page is outdated and is no longer updated.

My Google Scholar Profile

https://scholar.google.bg/citations?user=rVPn80EAAAAJ

My Research Gate Profile

https://www.researchgate.net/profile/Svetlin_Nakov

Open Source Toolkit for Extraction of Cognates and False Friends (TECFF) (September 2009)

The Open Source Toolkit for Extraction of Cognates and False Friends (TECFF) implements the most significant algorithms designed as part of my research for my PhD thesis:

MMEDR – algorithm for measuring weighted orthographic similarity between Bulgarian and Russian words taking into account some linguistically motivated Bulgarian-Russian correspondences (current supports Bulgarian and Russian only)
SemSim – algorithm for measuring semantic similarity between words by searching in Google and analyzing the returned text snippets (currently supports Bulgarian, Russian and English)
CrossSim – algorithm for measuring cross-lingual semantic similarity by searching in Google and analyzing the returned text snippets (currently supports Bulgarian and Russian only)
FFExtract: algorithm for extracting false friends from parallel corpus by determining candidates through MMEDR algorithm and combining statistical and semantic evidence for distinguishing between cognates and false friends (currently supports Bulgarian and Russian only)

The toolkit is implemented in C# and is available as open source software under the MIT license.

[Read more here…]

NakovDocumentSigner (September 2003 – February 2006)

NakovDocumentSigner is a digital document signing framework for Java-based Web applications. It is freeware open-source project intended to provide the Web applications with digital signature functionality. NakovDocumentSigner allows the users to digitally sign and upload files directly from their Web browsers. It consists of a Java-applet for digital signing and a reference Web application for digital signatures and certificates verification.

[Read more here…]

PhD Thesis: “Automatic Extraction of False Friends from Parallel Bilingual Corpus” (March 2007 – April 2010)

Svetlin Nakov’s PhD thesis “Automatic Extraction of False Friends from Parallel Bilingual Corpus” is a scientific research in the area of computational linguistics. It conducts research on the cognates and false friends between Bulgarian and Russian and aims to design innovative algorithms for their automatic extraction. New methods for measuring orthographic and semantic similarity (monolingual and cross-lingual) are proposed and their applications in solving various computational linguistics tasks are demonstrated, particularly for synonyms extraction, distinguishing between cognates and false friends and improving words alignment. A two-step method for automatic extraction of false friends from bi-texts is proposed: at the first step pairs of words with similar orthography are collected from the text and at the second step these pairs are categorized as cognates or false friends on the basis of measuring the cross-lingual semantic similarity between them using the Web as a corpus and by applying statistical techniques accounting their occurrences and co-occurrences in the corresponding sentences in the bi-text.

[Read more here…]

Microsoft .NET Framework Course and Teaching Materials (March 2004 – December 2006)

The project is intended to create a set of teaching materials for teaching a course on Microsoft .NET Framework Programming in Bulgarian language. These materials consist of presentations, lecture materials, exercises and a textbook and are available for free downloading. The whole course is available in the form of e-learning lessons. The project has earned the support of Microsft Reaserch and Sofia University “St. Kliment Ohridski”.

[Read more here…]

ArtsSemNet (November 2003)

ArtsSemNet is an electronic lexical reference system, similar to WordNet, for terminology of fine arts. The terms (over 2,600 for each language) are annotated with complete dictionary definitions and organized into a semantic network with two parallel versions: Bulgarian and Russian. Five important lexical relations are defined: polysemy, synonymy, homonymy, antonymy and hyponymy, the latter serving as the basis of the hierarchical organization of the ontology. In addition, a specialized browser is created thus providing an intuitive interface to query and navigate through the network.

[Read more here…]

Comments (17)

17 Responses to “Svetlin Nakov – Research Projects”

Evan West says:

December 3, 2024 at 07:41

It’s really what I need now!
geometry dash

Reply
Evan West says:

December 3, 2024 at 07:43

It’s really what I need now!
geometry dash

Reply
solar smash says:

March 4, 2025 at 07:47

It’s wonderful

Reply
Space Waves says:

June 18, 2025 at 11:34

great post

Reply
prankpayment mod app 2025 says:

July 11, 2025 at 09:59

Generate realistic fake receipts and payment confirmations in just a few taps using Prank Payment.

Reply
treewall says:

October 17, 2025 at 16:49

an incredibly well written research project i have read many but this is something else

Reply
- Guyana news says:
  
  April 15, 2026 at 01:31
  
  Stay informed with Guyana News—your source for breaking updates,trending topics, and insightful stories from Guyana and beyond.
  
  Reply
- CONEXO says:
  
  April 15, 2026 at 01:36
  
  Play the daily word connection game at CONEXO. Group 16 words into 4 hidden categories, earn points, build your streak, and solve easy to tricky puzzles every midnight.
  
  Reply
Recovery Hands says:

October 18, 2025 at 17:51

very nice

Reply
- septle says:
  
  April 15, 2026 at 01:32
  
  Septle is a unique seven-letter word game inspired by Wordle. Guess the hidden 7-letter word in 8 tries, and check the answer for a fun reveal
  
  Reply
Traffic Rider: The Ultimate Motorcycle Racing Game - Trend Burst says:

November 1, 2025 at 09:31

[…] https://nakov.com/research/#comment-476137 […]

Reply
Anoboy says:

January 13, 2026 at 10:14

Welcome to Anoboy One Piece Official Download Nonton Online Streaming Anime Subtitle Indonesia Kualitas Tinggi tersedia 240P 360P 480P 720P.

Reply
golf hit says:

February 2, 2026 at 08:29

Golf Hit redefines the classic gentleman’s game, stripping away the slow pace of traditional country clubs and replacing it with a high-speed, “orbital” arcade challenge

Reply
- Contexto says:
  
  April 15, 2026 at 01:33
  
  Contexto is a daily word-guessing game that challenges players to identify a secret target word by thinking critically about meaning rather than spelling.
  
  Reply
medmeds says:

April 18, 2026 at 13:40

Are you in need of authentic medications on the Internet? Check out medmeds and get real medications, competitive pricing, and speedy delivery. Enjoy a hassle-free and safe medical shopping experience without ever having to leave your house.

Reply
Z Image Turbo says:

April 27, 2026 at 18:32

I found the research projects overview very informative, especially the detailed descriptions of the computational linguistics and extraction algorithms. For anyone working with AI models and image generation, I also found this Z Image Turbo app really useful for quick visual prototyping.

Reply
Kebede Boqorada says:

May 4, 2026 at 03:21

**Interesting to see how TECFF has evolved since those early 2000s PhD algorithms. The shift from Java-based document signing to open-source C# cognate extraction really shows the breadth of research in NLP toolkits. Would love to know if any of those original algorithms are still relevant for modern cross-lingual similarity tasks.**

Reply