Big Data Software Engineering: Analysis of Knowledge Domains and Skill Sets Using LDA-Based Topic Modeling

dc.authoridGURCAN, Fatih/0000-0001-9915-6686
dc.authoridCagiltay, Nergiz/0000-0003-0875-9276
dc.authorscopusid57194776706
dc.authorscopusid16237826800
dc.authorwosidGURCAN, Fatih/AAJ-7503-2021
dc.authorwosidCagiltay, Nergiz/O-3082-2019
dc.contributor.authorÇağıltay, Nergiz
dc.contributor.authorCagiltay, Nergiz Ercil
dc.contributor.otherSoftware Engineering
dc.date.accessioned2024-07-05T15:28:34Z
dc.date.available2024-07-05T15:28:34Z
dc.date.issued2019
dc.departmentAtılım Universityen_US
dc.department-temp[Gurcan, Fatih] Karadeniz Tech Univ, Fac Engn, Dept Comp Engn, TR-61080 Trabzon, Turkey; [Cagiltay, Nergiz Ercil] Atilim Univ, Fac Engn, Dept Software Engn, TR-06830 Ankara, Turkeyen_US
dc.descriptionGURCAN, Fatih/0000-0001-9915-6686; Cagiltay, Nergiz/0000-0003-0875-9276en_US
dc.description.abstractSoftware engineering is a data-driven discipline and an integral part of data science. The introduction of big data systems has led to a great transformation in the architecture, methodologies, knowledge domains, and skills related to software engineering. Accordingly, education programs are now required to adapt themselves to up-to-date developments by first identifying the competencies concerning big data software engineering to meet the industrial needs and follow the latest trends. This paper aims to reveal the knowledge domains and skill sets required for big data software engineering and develop a taxonomy by mapping these competencies. A semi-automatic methodology is proposed for the semantic analysis of the textual contents of online job advertisements related to big data software engineering. This methodology uses the latent Dirichlet allocation (LDA), a probabilistic topic-modeling technique to discover the hidden semantic structures from a given textual corpus. The output of this paper is a systematic competency map comprising the essential knowledge domains, skills, and tools for big data software engineering. The findings of this paper are expected to help evaluate and improve IT professionals' vocational knowledge and skills, identify professional roles and competencies in personnel recruitment processes of companies, and meet the skill requirements of the industry through software engineering education programs. Additionally, the proposed model can be extended to blogs, social networks, forums, and other online communities to allow automatic identification of emerging trends and generate contextual tags.en_US
dc.identifier.citation61
dc.identifier.doi10.1109/ACCESS.2019.2924075
dc.identifier.endpage82552en_US
dc.identifier.issn2169-3536
dc.identifier.scopus2-s2.0-85068646074
dc.identifier.scopusqualityQ1
dc.identifier.startpage82541en_US
dc.identifier.urihttps://doi.org/10.1109/ACCESS.2019.2924075
dc.identifier.urihttps://hdl.handle.net/20.500.14411/2819
dc.identifier.volume7en_US
dc.identifier.wosWOS:000475354200001
dc.identifier.wosqualityQ2
dc.language.isoenen_US
dc.publisherIeee-inst Electrical Electronics Engineers incen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectBig data software engineeringen_US
dc.subjectcompetency mapen_US
dc.subjectknowledge domains and skill setsen_US
dc.subjecttopic modelingen_US
dc.subjectlatent Dirichlet allocationen_US
dc.titleBig Data Software Engineering: Analysis of Knowledge Domains and Skill Sets Using LDA-Based Topic Modelingen_US
dc.typeArticleen_US
dspace.entity.typePublication
relation.isAuthorOfPublicationc99221fa-e0c9-4b73-9f64-54919fcd3c58
relation.isAuthorOfPublication.latestForDiscoveryc99221fa-e0c9-4b73-9f64-54919fcd3c58
relation.isOrgUnitOfPublicationd86bbe4b-0f69-4303-a6de-c7ec0c515da5
relation.isOrgUnitOfPublication.latestForDiscoveryd86bbe4b-0f69-4303-a6de-c7ec0c515da5

Files

Collections