Megaputer Blog

Creating a pre-populated dictionary directly from the results of an analysis

22.08.2013Yuri Slynko,

PolyAnalyst 6.5.1402 introduces a new, convenient feature for creating custom dictionaries. When analyzing text, you can use dictionaries to bias and tweak your analytic results, such as by excluding keywords through the use of a stop list, or grouping keywords as synonyms through the use of a thesaurus-like dictionary. PolyAnalyst ships with a variety of default dictionaries to kick start your analytical projects. Sometimes the default dictionaries are enough, but sometimes they are insufficient, in which case you are then faced with the task of creating a new dictionary. This often happens because of the domain of your analytical task. To really gain insight from the data, you need to teach PolyAnalyst about the jargon the speakers in the data happen to be using. Just like visiting a foreign country, you can get around town much more easily when you speak the local language. The traditional way of doing this…

Читать далее

PolyAnalyst incorporates WordNet 3.0

06.04.2009Yuri Slynko,

For several years PolyAnalyst has successfully used WordNet 1.6 to provide customers with a basic starter English dictionary of terms and relations. WordNet is a popular online dictionary developed and maintained by Princeton University. PolyAnalyst incorporates a select number of word properties from WordNet into its various natural language processing tools. For example, PolyAnalyst uses WordNet to flesh out lists of hypernyms, hyponyms, merynyms, and antonyms. PolyAnalyst 6.0.950 introduces support for WordNet 3.0. After upgrading you will notice the new dictionary in the dictionaries list displayed by the Manage Dictionaries window. All text processing operations you perform after the upgrade will use the new dictionary version by default. An example of the new dictionaries window: For more information on WordNet, check out http://wordnet.princeton.edu/. To learn more about PolyAnalyst, visit https://www.megaputer.com/polyanalyst.

Читать далее

PolyAnalyst 6 adds a simple way of grouping several boolean columns into a few categorical columns

06.04.2009Yuri Slynko,

Starting in PolyAnalyst 6.0.950, you now have access to a new node named Categorize Binaries. The node provides a method for deriving a compact representation of hundreds of multiple choice variables. The node operates by converting a large set of binary variables into a small number of categorical variables. For example, suppose you have a dataset of sales transactions involving hundreds of products. Suppose the dataset is designed so that each separate product is represented in a separate column, where a product column is true if the product was present in a sales transaction. Let us take a popular product line like names of beer (Pilzner, Carlsberg, Guinness, etc). Customer Id Bought Pilzner? Bought Carlsberg? Bought Guinness? 1 Yes No No 2 No No Yes 3 No Yes No For most of the transactions only one brand is selected. In other words, most of the binary product dataset attributes are…

Читать далее

Навигация по записям

< 1 … 4 5 6 7

Categories

Featured articles

  • Why We Use Dependency Parsing
  • Prepping the XPDL Seminar Series

Social links

Email
Facebook
Twitter
Google+
LinkedIn
YouTube

Продукты

PolyAnalyst Pro

PolyAnalyst Text

PolyAnalyst Reports

Sapremo

Решения

Интеллектуальные решения

Галерея проектов

Обучение

Видео-инструкции

Лекторий по анализу данных

Документация

Проверка сертификата

Свяжитесь с нами

+7(499)7530129

info@megaputer.ru

©2000-2021. ООО «Компания «Мегапьютер Интеллидженс». Пользовательское соглашение. Политика конфиденциальности.