Sociograph of authors. Copyright Fernando de Assis Rodrigues. Conference Paper

Domain Analysis of scientific production about Data Collecting on…

The goal of this study is to identify scientific studies about the thematic of data collecting. For this purpose, it was adopted the domain analysis method on the scientific papers, by an application of Citation and Co-citation Analysis.

The identification of representatives from the thematic of data collecting and the existent dialog among them were obtained by authors and papers metadata sets processing, available on IEEE Xplore(r) Digital Library. As a search strategy, it was used on advanced search the terms ‘Data Collecting’, ‘Data Collect’, and ‘Data Gathering’, concatenated by the boolean operator ‘OR’. This process recovered 2,278 scientific papers and the sample was set only by scientific papers published in scientific journals between the years 1954 and 2018, with a total of 281 papers.

For each paper, the reference section was collected in the HTML document format. It was applied an algorithm to convert formats from HTML documents to CSV files and also to serialize the IEEE Editorial Style found on collected reference data. The algorithm processed 5,867 references and discarded 270 because they are not fit into the IEEE Editorial Style standards adopted on serialization.

From these references, was identified as a total of 8,267 authors. In Citation and Co-citation Analysis, it was applied the Price’s square root law to delimit the authors’ group to 91 participants, rounded to 94 participants because of the 91st participant had the same total of citation of his 3 successors.

After that, the “Cited and Who cited” and the “Absolute Frequency of Co-citation” matrices were generated from an application of an algorithm. By those data, the identification of nationality and the institutional affiliation were obtained by a manual process. Was calculated the social networks indexes i) Network Density, representing the relationship intensity between authors on the network and ii) Centrality Degree, representing the number of relationships received by an author.

The analyzed data resulted in a Network Density value of 3.20 with a standard deviation of 3.34, that is, each researcher has approximately 3 relationships with other network nodes. Also, the resulted value of Centrality Degree was 20.93%, demonstrating dispersion on the network, once that each node has 20.93% of probability to receive some interaction from the network.

This dispersion is associated with the analyzed domain amplitude, once that Data Collecting is a recurrent theme on distinct knowledge areas, but still adherent to IEEE scientific journals context.

When results of the Centrality Degree of each author are analyzed, it is possible to observe a relationship between the results of received citations, indicating that the 13 best-ranked authors by Centrality Degree are also the most cited ones, representing 25.16% of all citations from the network. Also in this group was identified an average of 7.69% from the total of cites to each author, with amplitude varying between 6.12% and 11.76%.

It was concluded that this thematic, although widely cited, shows an American core, related to the institutions UC, USC, and MIT.

Keywords: Data Collecting. Domain Analysis. IEEE.

Authors

  1. Fernando de Assis Rodrigues
  2. Fábio Mosso Moreira
  3. Ricardo César Gonçalves Sant’Ana

Full text available at

  1. X EIICA
  2. Research Gate
Conference Paper

Information and Technology: Thematic Course of IG 08

The thematic interest group course IG 08 was identified from the analysis of the papers presented in oral communication and poster, during the years 2008-2016. The qualitative and quantitative study took place from the domain analysis, to identify how the approach of technologies inside the IG 08 from seven categories of structured analysis from the menu of the IG and the proposal of Santos et al. (2013), identifying themes, authors, and institutions in each of them. This step of the study resulted in the mapping of the IG 08 – Information and Technology, with the display of the main approaches about the technologies in the context of IG 08, your distribution in terms of presentation and the rankings of authorship and of institutions.
Read more “Information and Technology: Thematic Course of IG 08”

Conference Paper

Contextualizing theoretical concepts on Online Social Networks data collecting…

The use of Online Social Network services raise concerns in the way information from individuals is shared, such as starting from the process of collecting data from users that are stored in the institutions that own this services. The purpose of this study is to establish a contextualization of the concepts involved in the data collection available at online social network services, based on the analysis of content in technical-operational documents and in Terms of Use, and by an exploration of the characteristics of the data collection interfaces. Read more “Contextualizing theoretical concepts on Online Social Networks data collecting process”

Conference Paper

Information and Technology in ENANCIB: Trends of GT #08

The scientific development established between areas of knowledge has as main characteristic the activities developed by the scientific community, which is set for the research activities and collective research groups, in an individually way. Analyzes are performed through indicators on science and technology that provide measures about some aspects identified in the scientific literature.

This study aims to map papers presented on the Working Group Information and Technology – GT 08 at ENANCIB, from years between 2008 and 2015. Read more “Information and Technology in ENANCIB: Trends of GT #08”