Analyzing Code Comments to Boost Program Comprehension

05/06/2019
by   Yusuke Shinyama, et al.
0

We are trying to find source code comments that help programmers understand a nontrivial part of source code. One of such examples would be explaining to assign a zero as a way to "clear" a buffer. Such comments are invaluable to programmers and identifying them correctly would be of great help. Toward this goal, we developed a method to discover explanatory code comments in a source code. We first propose eleven distinct categories of code comments. We then developed a decision-tree based classifier that can identify explanatory comments with 60 that are written in two languages: Java and Python. This task is novel in that it focuses on a microscopic comment ("local comment") within a method or function, in contrast to the prior efforts that focused on API- or method-level comments. We also investigated how different category of comments is used in different projects. Our key finding is that there are two dominant types of comments: preconditional and postconditional. Our findings also suggest that many English code comments have a certain grammatical structure that are consistent across different projects.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/23/2022

Preprocessing Source Code Comments for Linguistic Models

Comments are an important part of the source code and are a primary sour...
research
03/02/2023

Performance Comparison of Binary Machine Learning Classifiers in Identifying Code Comment Types: An Exploratory Study

Code comments are vital to source code as they help developers with prog...
research
06/25/2020

Source Code Comments: Overlooked in the Realm of Code Clone Detection

Reusing code can produce duplicate or near-duplicate code clones in code...
research
09/22/2017

Code Attention: Translating Code to Comments by Exploiting Domain Features

Appropriate comments of code snippets provide insight for code functiona...
research
07/10/2019

Identifying Algorithm Names in Code Comments

For recent machine-learning-based tasks like API sequence generation, co...
research
03/18/2022

How Do Programmers Express High-Level Concepts using Primitive Data Types?

We investigated how programmers express high-level concepts such as path...
research
08/25/2021

RepliComment: Identifying Clones in Code Comments

Code comments are the primary means to document implementation and facil...

Please sign up or login with your details

Forgot password? Click here to reset