Menu

Nakov.com logo

Thoughts on Software Engineering

Svetlin Nakov was Awarded with a PhD Degree in Informatics

Today Svetlin Nakov defended successfully his PhD thesis titled “Automatic Extraction of False Friends from Parallel Bilingual Corpus” and was awarded with the scientific and educational degree “Doctor of Philosopy” (PhD) in Informatics in the area of computational linguistics.

Svetlin Nakov - PhD - defense

The thesis was defended according to the Bulgarian law, in front of the Specialized Scientific Council in Informatics and Mathematical Modeling of the Higher Attestaion Commission of the Bulgarian Academy of Sciences (BAS). Unlike Western Europe and USA in Bulgaria PhD degree is given by national specialized scientific council consisting of about 20 distinguished scientists.

Svetlin Nakov - PhD thesis defense - discussions

The Start

I started work on my PhD thesis in 2007 after I changed my research area to computational linguistics. Initially it was hard to find a research topic which was not well researched and where open questions exist that could be approached. With the help of my research advisor Prof. Paskaleva and by the help of external consultants we found an interesting research topic: false friends. It was relatively easy to research and develop new algorithms for extracting false friends, especially for Bulgarian and Russian due to the fact that cognates and false friends in this particular pair of languages was never been researched by computational linguists. Additionally the idea to use the Web as a corpus was just started to get popular approach for natural language processing and information retrieval.

False Friends – Definition

False friends are words in different languages that are similar spelling and are perceived as similar but have different meanings. For example Bulgarian word “стар” which means “old” and is pronounced [star] and the English word “star” are false friends. They have exactly the same pronunciation but have entirely different meanings.

Automatic Extraction of Cognates and False Friends from Parallel Bilingual Corpus – Abstract

The PhD thesis “Automatic Extraction of Cognates and False Friends from Parallel Bilingual Corpus” conducts research about cognates and false friends between Bulgarian and Russian and proposes algorithms for their extraction. New methods for measuring orthographic and semantic similarity (monolingual and cross-lingual) are proposed and their applications in solving various computational linguistics tasks are demonstrated, particularly for synonyms extraction, distinguishing between cognates and false friends and improving words alignment. A two-step method for automatic extraction of false friends from bi-texts is proposed: at the first step pairs of words with similar orthography are collected from the text and at the second step these pairs are categorized as cognates or false friends on the basis of measuring the cross-lingual semantic similarity between them using the Web as a corpus and by applying statistical techniques accounting their occurrences and co-occurrences in the corresponding sentences in the bi-text.

Scientific Research and Publications

During my work as PhD student I managed to publish 7 scientific papers related to my PhD thesis (as author or co-author):

  • Nakov P., Nakov S., Paskaleva E. “Improved Word Alignments Using the Web as a Corpus”, Proceedings of International Conference “Recent Advances in Natural Language Processing” (RANLP 2007), pages 400-405, Borovets, Bulgaria, 2007
  • Nakov S., Nakov P., Paskaleva E. “Cognate or False Friend? Ask the Web!”, Proceedings of the 1st International Workshop on Acquisition and Management of Multilingual Lexicons, held in conjunction with RANLP 2007, pages 55–62, Borovets, Bulgaria, 2007
  • Nakov S. “Automatic Acquisition of Synonyms Using the Web as a Corpus”. Proceedings of the 3rd Annual South-East European Doctoral Student Conference (DSC 2008), Volume 2, pages 216-229, Thessaloniki, Greece, 2008
  • Nakov S. “Measuring Cross-Lingual Semantic Similarity by Searching in Google”. Proceedings of the 5th International Conference “The Language: A Phenomenon without Frontiers”, ISBN 978-954-9685-43-5, pages 238-242, Varna, Bulgaria, 2008
  • Nakov S. “Automatic Identification of False Friends in Parallel Corpora: Statistical and Semantic Approach”, Serdica Journal of Computing, issue 3, pages 133-158, 2009
  • Nakov S., Nakov P., Paskaleva E. “Unsupervised Extraction of False Friends from Parallel Bi-Texts Using the Web as a Corpus”, Proceedings of International Conference “Recent Advances in Natural Language Processing” (RANLP 2009), pages 292-298, Borovets, Bulgaria, 2009
  • Nakov S., Paskaleva E., Nakov P. “A Knowledge-Rich Approach to Measuring the Similarity between Bulgarian and Russian Words”, Workshop on Multilingual Resources, Technologies and Evaluation for Central and Eastern European Languages held in conjuction with RANLP 2009, Borovets, Bulgaria, 2009

Regardless of the fact that most of my publications were made in Bulgaria, these are published in prestigious conferences like RANLP which is ranked in the top 5 conferences in computational linguistics in the world. It is notable that most of the distinguishing authors cited in my papers attend the RANLP conference.

In the beginning it was new to me how to write high-quality scientific papers that will be accepted with high probability in distinguishing conferences in computational linguistics but I got solid help from my co-authors and my scientific advisor. Initially I believed that it is more complex to invent a new concept, method, framework, theorem, formula or algorithm and to obtain valuable scientific results than to publish them as paper. I found that this assumption is not exactly true – sometimes it takes more time and effort to publish the scientific results than to obtain them.

Lessons Learned

What I learned from my PhD is how to conduct scientific research: how to perform scientific experiments, how to evaluate the obtained results, how to draw motivated conclusions and how to publish the results in a way that will make the reviewers happy. I learned how to write scientific papers with ease: how to structure their content, how to state the proposed ideas as motivated extension of the most recently published scientific achievements (related work), how to present the experiments, how to describe the obtained results in short but clear manner and how to cite related publications. It was nice experience and now I know when I am reading an article whether it is low-quality marketing text or well motivated scientific work.

Graduating PhD Means Really Hard Work!

I started my PhD just as a natural continuation of my high education. I graduated bachelor and masters degrees with excellent results and with ease and I thought PhD will also be easy, but it was different, entirely different.

When I started my PhD work I was author of 4 books and had solid experience as software engineer and trainer so I believed I am good writer and developer and engineer and will cope with the PhD challenge with ease. But conducting scientific research is different. It is not just an application of existing knowledge to solve a specific problem or successfully deliver a software project. It is about inventing new concepts, methods and algorithms, not previously know to anybody. It is about researching open problems, about inventing and experimenting new methods for approaching them and about finding new algorithms and formulating new concepts that could not be found in any book or publication.

PhD Thesis == 5-10 Times * Master’s Thesis

I needed about a month of active work to write my Master’s Thesis. Most people invest similar amount of time and effort for theirs. To prepare and defend a PhD Thesis I needed 5-10 times more effort, time and work. Publishing a valuable research paper could take few weeks for inventing new ideas, trying them, conducting experiments and obtaining meaningful results and takes more few weeks to write the paper itself in a way that makes the reviewers happy. Publishing 7 papers means 5-6 months of active work which I did for 3 years mostly in the weekends. Writing the PhD Thesis itself takes additionally a month of full time work. Thus compared to a typical Master’s Thesis graduating successfully a PhD degree takes 5-10 times more effort than to write a Master’s Thesis.

This was my experience. I am sure that some people graduate successfully with less effort but I could not afford myself doing low-quality work. I am just a person who works hard and with high-quality.

If I knew how much effort this PhD degree would require I would probably not start it.

Downloads

If you are interested in my research area, please feel free to download the presentation of my PhD thesis: Nakov-PhD-Thesis-False-Friends-Presentation.ppt (PowerPoint presentation, in Bulgarian).

Also download the extended resume of my work (abstract): Nakov-PhD-Avtoreferat-False-Friends.pdf (PDF file, in Bulgarian).

Comments (7)

7 Responses to “Svetlin Nakov was Awarded with a PhD Degree in Informatics”

  1. A wise guy says:

    Congratulations for your achievement!

    However, I would kindly like to disagree with parts of your words, taken into a generic context (i.e. not wrt you in particular). Scientific effort&results are extremely hard to measure, almost impossible. The time spent for a given task, is quite irrelevant for the quality of the results obtained. No obvious relation could be established between those two concepts. Next, in the world we have thousands of universities, from most elite ones, to almost rural institutions, known to a few people. Quality of the work, that is sufficient for earning a given scientific degree, is completely different for the different institutions.

    Nowadays, there is almost no real science. Most of the universities aim at producing numberless amounts of papers, research, etc. All in all, there have to be PhDS, and professors, right ? Science is a money machine, and a way to earn prestige, nothing else. But this prestige will only matter to those that are still blind, and cannot realize the truth that everyone could become PhD. From Kadaffi’s son (I am sure that he is a genius), to Ivan Slavkov, who is very far ahead of you – he is academician! Sooner or later, people will become to understand that science is a modern sophisticated way to find a comfortable way in life, and some perstige, by doing … a Hi-Fi bullshit, and meaningless work, which could be detected as such only by smart people, and not my the masses. (pity for the huge amount of paper wasted for printing all those articles/papers).

    And last, but not least, I know from a trustworthy resource, that you did cheat on some math exams during your bachelor degree. Not saying that you cannot pass them with excellence with your own brains (well proven by all your huge successes in CS olympiads, that no one can deny)… but truth is: you cheated.

  2. nakov says:

    Wise guy, wise said!

    Yes, I agree: the quality of the work, that is sufficient for earning a given scientific degree, is completely different for the different institutions. The description above was my particular experience shared with the community. I know few people earned PhD degree with less effort. Yes this could be achieved. The effort and time spent does not measure the quality and results. That’t why I work as software engineer and trainer and not as scientist.

    I also agree that the PhD degree is more about earning a prestige than contributing to the science. Most PhD holders (like me) have just completed the PhD requirements and are not distinguished scientists. I described the Bulgarian requirements that officially drive the scientific community (at the time I got my PhD).

    An finally, I don’t like math and math exams. The good news are that being an exceptional software engineering professional and being successful in the real life or even being a scientist does not require exceptional math skills.

  3. Georgi says:

    Човече,случайно попаднах на сайта ти, но за мен ти си велик Българин,евала ти правя за твоите идеи и мечти,такива хора като теб трябват,успех във всички начинания!

  4. amaon.zom says:

    Hello! Quick question that’s totally off topic. Do you know how to
    make your site mobile friendly? My blog looks weird when browsing from my iphone 4.

    I’m trying to find a template or plugin that might be able to correct this issue.
    If you have any suggestions, please share. Appreciate it!

  5. Helklo my friend! I want to say that this post is awesome, nice written and come with approximarely all important infos.
    I would like to seee extra posts like this
    .

    my blog password protect

  6. albanian guy says:

    – please update your book (C#) or tell people not to use it because maybe its outdated (from 2013).
    – Change the cover of the book because the shark looks dangerous

  7. The information is very special, I will have to follow you.

RSS feed for comments on this post. TrackBack URL

LEAVE A COMMENT