Skip to content

Latest commit

 

History

History
94 lines (61 loc) · 8.83 KB

ACKNOWLEDGEMENT.md

File metadata and controls

94 lines (61 loc) · 8.83 KB

This model includes derived data/code from the following awesome open source copyrighted material:

WikiData

Wikidata is a free and open knowledge base that can be read and edited by both humans and machines. Wikidata acts as central storage for the structured data of its Wikimedia sister projects including Wikipedia, Wikivoyage, Wiktionary, Wikisource, and others.

The content of Wikidata is available under CC0 1.0 Universal (CC0 1.0) Public Domain Dedication license, exported using standard formats, and can be interlinked to other open data sets on the linked data web.

License Summary

The person who associated a work with this deed has dedicated the work to the public domain by waiving all of his or her rights to the work worldwide under copyright law, including all related and neighboring rights, to the extent allowed by law.

You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission.

Reference:

  1. https://www.wikidata.org/
  2. https://creativecommons.org/publicdomain/zero/1.0/

Wordnet 3.0

WordNet® is a large lexical database of English. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a distinct concept. Synsets are interlinked by means of conceptual-semantic and lexical relations. The resulting network of meaningfully related words and concepts can be navigated with the browser(link is external). WordNet is also freely and publicly available for download. WordNet's structure makes it a useful tool for computational linguistics and natural language processing.

License Summary

WordNet Release 3.0 This software and database is being provided to you, the LICENSEE, by Princeton University under the following license. By obtaining, using and/or copying this software and database, you agree that you have read, understood, and will comply with these terms and conditions.: Permission to use, copy, modify and distribute this software and database and its documentation for any purpose and without fee or royalty is hereby granted, provided that you agree to comply with the following copyright notice and statements, including the disclaimer, and that the same appear on ALL copies of the software, database and documentation, including modifications that you make for internal use or for distribution. WordNet 3.0 Copyright 2006 by Princeton University. All rights reserved. THIS SOFTWARE AND DATABASE IS PROVIDED "AS IS" AND PRINCETON UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR IMPLIED. BY WAY OF EXAMPLE, BUT NOT LIMITATION, PRINCETON UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES OF MERCHANT- ABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE OR THAT THE USE OF THE LICENSED SOFTWARE, DATABASE OR DOCUMENTATION WILL NOT INFRINGE ANY THIRD PARTY PATENTS, COPYRIGHTS, TRADEMARKS OR OTHER RIGHTS. The name of Princeton University or Princeton may not be used in advertising or publicity pertaining to distribution of the software and/or database. Title to copyright in this software, database and any associated documentation shall at all times remain with Princeton University and LICENSEE agrees to preserve same.

Reference:

  1. https://wordnet.princeton.edu/
  2. https://wordnet.princeton.edu/license-and-commercial-use

fnTBL

fnTBL is a customizable, portable and free source machine-learning toolkit primarily oriented towards Natural Language-releated tasks (POS tagging, base NP chunking, text chunking, EOS detection, word sense disambiguation). It can handle reasonably sized, discrete classification task (i.e. the samples can be characterized as vectors with discrete components).

License Summary

Copyright (c) 2001 Johns Hopkins University and Radu Florian and Grace Ngai.

Permission is hereby granted, free of charge, to any person obtaining a copy of this software, fnTBL version 1.0, and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS
OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Reference:

  1. http://www.cs.jhu.edu/~rflorian/fntbl/
  2. https://www.cs.jhu.edu/~rflorian/fntbl/license.html

ML-SentiCon

This resource contains lemma-level sentiment lexicons at lemma level for English, Spanish, Catalan, Basque and Galician. For each lemma, it provides an estimation of polarity (from very negative -1.0 to very positive +1.0), and a standard deviation (related with ambiguity of the polarity estimation, please refer to paper for further details).

These lexicons are layered, allowing to trade off between the amount of available words and the accuracy of the estimations. The lexicons have been automatically generated from an improved version of SentiWordNet, a very popular resource which contains estimations of the positivity and negativity of synsets. The resource containing all the lexicons, ML-SentiCon, is publicly available. © 2014 Sociedad Española para el Procesamiento del Lenguaje Natural.

Cruz, Fermín & Troyano, José & Pontes, Beatriz & Ortega, F. Javier. (2014). ML-SentiCon: A multilingual, lemma-level sentiment lexicon. Procesamiento de Lenguaje Natural. 53. 113-120.

Emoji Sentiment Ranking

A lexicon of 751 emoji characters with automatically assigned sentiment. The sentiment is computed from 70,000 tweets, labeled by 83 human annotators in 13 European languages. The process and analysis of emoji sentiment ranking is described in the paper: P. Kralj Novak, J. Smailović, B. Sluban, I. Mozetič, Sentiment of Emojis, submitted; arXiv preprint, 2015.

License Summary

You are free to

  1. Share — copy and redistribute the material in any medium or format
  2. Adapt — remix, transform, and build upon the material for any purpose, even commercially.

The licensor cannot revoke these freedoms as long as you follow the license terms:

Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.

No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.

References:

  1. http://arxiv.org/abs/1509.07761
  2. https://figshare.com/articles/Emoji_Sentiment_Ranking/1600931/1
  3. https://creativecommons.org/licenses/by/4.0/

wink-lexicon

English lexicon useful in NLP/NLU. It is licensed under the terms of the MIT License.

License Summary

Copyright (c) 2017-19 GRAYPE Systems Private Limited

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Reference:

  1. https://github.com/winkjs/wink-lexicon
  2. https://github.com/winkjs/wink-lexicon/blob/master/LICENSE