SEUIR Repository

Github application programme interface and wordnet for code reuse

Show simple item record

dc.contributor.author Pirapuraj
dc.contributor.author Perera, Indika
dc.date.accessioned 2016-12-30T09:49:02Z
dc.date.available 2016-12-30T09:49:02Z
dc.date.issued 12/29/2016
dc.identifier.citation Proceedings of Fifth Annual Science Research Sessions 2016 on "Enriching the Novel Scientific Research for the Development of the Nation" pp.7-16 en_US
dc.identifier.isbn 9.78956E+12
dc.identifier.uri http://ir.lib.seu.ac.lk/handle/123456789/2050
dc.description.abstract It is clear that code reuse is important task in software development and maintenance. As a lot of software application and source code have been used as libraries in version control systems, such that Git, SVN, LibreSource and related web sites, such that GitHub.com, sourceforge.net, projectsgeek.com, Googlecode.com, more and more companies, especially Small and Medium Enterprises (SMEs), are reusing open source code to develop their own software. The problem in code reuse is, after download all relevant code, we need to identify most relevant code among pool of code. In this paper we use keyword search with n-gram NLP technique using GitHub Application Program Interface (API). Before search the source code, we retrieve all Repository name in GitHub belongs to particular programing language (JAVA, C++, etc.), as well as we retrieve all .java file name if we search java libraries using GitHub API. Then compare our keyword with this list, if the keyword extracted from Software architecture is connected word, then we will split using Apache Camel Splitter. If the particular keyword related to any project, we download the project. Otherwise using WordNet, get some synonym and do the above process again. For further relevancy, we will use a speech recognition technique (Dynamic Time Warping (DTW)) and a NLP technique (Part of Speech Tagging (POS)). Because of this is a part of the whole research, in this paper we will consider only GitHub API. en_US
dc.language.iso en_US en_US
dc.publisher Faculty of Applied Sciences, South eastern University of Sri lanka en_US
dc.subject n-gram en_US
dc.subject GitHub API en_US
dc.subject WordNet en_US
dc.subject Dynamic Time Warping en_US
dc.subject Part of Speech Tagging en_US
dc.subject Apache Camel Splitte. en_US
dc.title Github application programme interface and wordnet for code reuse en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search SEUIR


Advanced Search

Browse

My Account