About "academia" on monperrus.net

by Martin Monperrus
April 21, 2018: How to make a good open-science repository?
If you're doing some kind of computational or data-intensive research, this post describes how to make a good open-science repository about your digital experimental results. It summarizes my experience in doing such replication packages over years for my software engineering experiments. >>> read more
March 23, 2018: The Multiple Goals and Data in Data-Mining for Software Engineering
Data mining for software engineering consists of collecting software engineering data, extracting some knowledge from it and, if possible, use this knowledge to improve the software engineering process, in other words “operationalize” the mined knowledge. >>> read more
October 20, 2017: Open-science and Double-blind Peer-Review
Recently, double-blind peer-review has fallen over my research community as a storm. Unfortunately, beyond its noble goal of reducing unfairness, double-blind peer-review may have detrimental collateral effects on open-science. >>> read more
May 30, 2017: How to use LuaTeX and LuaLaTeX on arXiv?
The arXiv open-access archive only supports pdftex/pdflatex. If you like using LuaLaTeX and all its wonderful features, it is very likely that you won't be able to compile it identically using pdflatex. >>> read more
May 22, 2017: Copy bibtex to clipboard in Google Scholar
I am addicted to Google Scholar. It has found so many excellent papers related to my research! However, there is one thing for which it sucks: Bibtex export. >>> read more
December 05, 2016: How to receive email notifications about new Arxiv papers?
How to receive notifications about new Arxiv papers in software engineering and programming languages? Email You have to write an email to cs@arxiv.org, with subject subscribe your name and email body: add Programming Languages add Software Engineering The official (and unclear) doc is https://arxiv. >>> read more
November 09, 2016: Cleaning paper sources for open-access websites such as Arxiv
Some open-access archive websites require to upload the sources of the paper (eg Latex sources). However, the sources may contain comments that are either private or not appropriate for public viewing and eternity archival. >>> read more
September 01, 2016: Makitoo: la réparation à chaud du labo à la startup
Makitoo est une startup issue des travaux de recherche en génie logiciel menés au laboratoire CRISTAL et Inria Lille, dans l'équipe Spirals. Créée en mars 2016, Makitoo propose des solutions de monitoring logiciel couplées avec la réparation à chaud, le tout en production. >>> read more
August 11, 2016: The tasks of a publication chair
What's the duties of a publication chair in a conference? Here are some humble notes from my small experience and the pieces of advice of Francesca Lonetti. >>> read more
August 01, 2016: Interesting pointers about open software science
awesome-open-science-software lists resources about open science and software: software as main experimental tool (aka computational methods) software as main study object (software engineering, programming language, systems software, ...). >>> read more
June 02, 2016: Python script to create an RSS feed from DBLP
Creates an RSS feed with DBLP JSON data from its API. #!/usr/bin/python2.7 """ Creates an RSS feed with DBLP JSON data from its API. Meant to be used as CGI script but can also be used as command-line. Feedback and pull request welcome. >>> read more
April 23, 2016: Open-science and Travis: continuous integration for reproducible scientific experiments
BLUF: one can use the concepts from continuous integration for achieving nearly perfectly reproducible computational experiments. A continuous integration server is a server that compiles and runs automated tests for each commit of a software project. >>> read more
February 13, 2016: How to write a good author response?
Peer-review has many different forms. One classical form is the journal publication process, where authors submit articles, reviewers comment on it, and the authors are invited to submit a revised version based on the comments. >>> read more
January 27, 2016: Lab notes on compiling genprog
Here are some notes on the Genprog automatic repair tool. Note that Genprog is now on Github: https://github.com/squaresLab/genprog-code Compiling version SVN 1688 genprog-code-snapshot-svn1688.tar.gz First compile CIL wget http://downloads.sourceforge.net/project/cil/cil/cil-1.7.3.tar. >>> read more
January 15, 2016: Required qualifications for research
Here are the key qualifications for successful research. be really excited about your project be ok working hard be persistent (get back up and continue when things don't work out as planned because true research rarely works out as planned) be fearless (e.g. >>> read more
November 24, 2015: Open-science and code sharing
In my software engineering research, I'm often curious about the prototypes and experiments of others, I want to know more, beyond what's written in the paper, I want to try and experiment myself. >>> read more
November 23, 2015: Introduction to Antifragile Software by M. Monperrus
Antifragile software in a nutshell Antifragile software engineering is the art of building ultra dependable software systems running in ultra open and unpredictable environments. >>> read more
October 16, 2015: Allopatric software diversification
In ecology, allopatric diversification is when the same population is split in two different locations and diversification happens after the split. In this post, we present the concept of ``allopatric software diversification'', directly inspired from the ecological concept. >>> read more
September 01, 2015: Martin's Collection of Bug Stories
I work on automatic repair of software bugs. I love good bugs stories. Classical (catastrophic) These are even meme. >>> read more
June 27, 2015: Un flux RSS pour les postes de Galaxie
En France, les postes académiques (Maître de conférence, Professeur) des Universités dépendantes du Ministère de l'Enseignement Supérieur sont publiés sur un site nommé Galaxie. >>> read more
June 23, 2015: A Monte-Carlo study of Spearman correlation coefficient
This is small Python program to study Spearman correlation coefficient in order to find/validate the critical values. >>> read more
May 07, 2015: The "Revise and resubmit as new" option
Recently, as author and reviewer, I discovered that some academic journals use a new kind of notification "Revise and resubmit as new". >>> read more
May 02, 2015: Graphical Understanding of Cohen's d Effect Size
I've just read "A systematic review of effect size in software engineering experiments" (IST, 2007). The authors say that in software engineering, an effect size is considered as: small if Cohen's d is approx. 0.17 medium if Cohen's d is approx. 0.60 large if Cohen's d is approx. 1. >>> read more
April 12, 2015: RSS feeds for software engineering jobs in academia
IEEE job site: source, software engineering, software engineering professorships Euraxess: source, software engineering, software engineering professorships Eurosys: source, professorships only Computeroxy: http://computeroxy.com/ ** Job web site ACM**: https://jobs.acm. >>> read more
October 15, 2014: A Discussion on Antifragile Software
Questions by Russ Miller and Bett Correa. Answers by Martin Monperrus, based on the paper "Principles of Antifragile Software" Please explain what is meant by Antifragile by Taleb?Antifragile by Taleb is a radically new perspective on errors. >>> read more
June 26, 2014: Automatic software repair
I've been working on automatic software repair. I maintain this curated list of pointers for helping newcomers understand the field. >>> read more
May 08, 2014: Software Engineering DBLP Search
Here is a Javacript interface for searching papers in DBLP with easy filters on software engineering (or related) conferences and journals. >>> read more
May 07, 2014: Notes on program synthesis
Here are some notes on program synthesis. >>> read more
March 28, 2014: Empirical Software Engineering Research: Methodology Papers
Here are important papers about empirical research in software engineering. >>> read more
March 26, 2014: Empirical studies on bugs
I love bugs. I work on automatic software repair. Hence, I am constantly looking for empirical knowledge on bugs. Here are the main papers on this topic. I you know more of them, please drop me an email. >>> read more
January 03, 2014: What is self-healing software?
What is self-healing software? "Self-healing software provides automatic responses to faults of a given fault model in a certain design context" [4]. I recommend Koopman's paper [4] and Keromytis' paper [3] for an excellent overview of the field. >>> read more
November 12, 2013: How to write a good scientific review
A review is a critical evaluation of a paper. In a usual peer-review process, a review is meant to be read: by the editor (or the PC chair) and by the authors. The editor/chair uses it to accept/reject the paper, the authors get a wealth of information to improve their paper. >>> read more
June 20, 2013: Companion Web Page for "Empirical Evidence of Large-Scale Diversity in API Usage of Object-Oriented Software"
Raw Dataset: Content: 3 418 Jar files, which include 382 774 different types (classes or interfaces). Source: All Jar files present on a machine used for performing software mining experiments for 7 years. URL: http://ecologyse.gforge.inria.fr/dataset-jars-scam2013.zip (1. >>> read more
April 16, 2013: Six Golden Rules for Writing, Using and Sharing Research Prototypes
A research prototype demonstrates an idea, collects some data for sake of writing a scientific paper. When it's software, it must be considered as a special kind of software. >>> read more
December 31, 2012: The 1000-index: a bibliometric for outstanding scholars
What is the 1000-index? This post proposes a new bibliographic metric, defined as follows: The 1000-index is the number of papers cited at least a thousand times. I use Google Scholar as citation database to compute it. >>> read more
November 25, 2012: Publication lists with HAL and bibtexbrowser
[[http://hal.archives-ouvertes.fr/?langue=en|Hal]] is an open-access archive, similar to [[http://arxiv.org/|Arxiv]]. It is funded and maintained by French institutions such as [[http://www.cnrs.fr/|CNRS]] and [[http://www.inria.fr|INRIA]]. >>> read more
October 31, 2012: How to Do Good Scientific Research?
Quotes "The great thing about the research world is that you get to choose your environment, which consists in large measure of the members of your network. >>> read more
May 25, 2012: Companion Web Page for "Abmash: Mashing Up Legacy Web Applications by Automated Imitation of Human Actions"
Demo Code and Documentation See http://github. >>> read more
April 11, 2012: Pointers on abstract syntax tree differencing algorithms and tools
This post presents papers and tools on semantic source code differencing. It is a special kind of tree differencing. Semantic source code diff Unix diff and successors (CVS, GIT diff) are line-based. On the contrary, semantic source code diff work on the abstract syntax tree (AST) [1,2,3,4,5,6,11]. >>> read more
March 26, 2012: A Comparison of Median and Mean
Mean and median are two measures to summarize a data set of N numerical values. However, there are not equivalent. Here is a thorough comparison of their properties. >>> read more
February 13, 2012: Sharing scientific datasets with BitTorrent
Bittorrent is an appropriate technology to share scientific datasets because it relieves web servers of distributing many copies of large files and enables a kind of replication. This documents presents how to share scientific datasets with bitorrents. First, you have to set a tracker and a seeder. >>> read more
February 09, 2012: Sciclomatic: A Peer-to-Peer System for Sharing Scientific Datasets
This post sketches a peer-to-peer system for sharing scientific datasets. Permalink: http://www.monperrus.net/martin/sciclomatic-sharing-scientific-datasets Introduction Academics, students and researchers obtain, create or use data in their experiments. >>> read more
February 09, 2012: Resources for the CVS-Vintage Dataset
CVS-Vintage is a dataset of 14 CVS repositories of Java applications. Paper CVS-Vintage: A Dataset of 14 CVS Repositories of Java Software Dataset Light version -- only Java files: http://hal.archives-ouvertes.fr/docs/00/76/91/21/ANNEX/cvs-vintage-light-v20120509.tar. >>> read more
October 14, 2011: Companion Web Page for "Detecting Missing Method Calls As Violations of the Majority Rule"
This page is the companion web page for "[[http://www.monperrus.net/martin/bibtexbrowser.php?key=Monperrus2011b&bib=monperrus.bib|Detecting Missing Method Calls As Violations of the Majority Rule]]" (Martin Monperrus and Mira Mezini). >>> read more
October 12, 2011: Companion Web Page for "What Should Developers Be Aware Of? An Empirical Study on the Directives of API Documentation"
This page is the companion web page for //"What Should Developers Be Aware Of? An Empirical Study on the Directives of API Documentation"// (Martin Monperrus, Michael Eichberg, Elif Tekes, Mira Mezini). [[dataset-src.tar. >>> read more
December 17, 2010: RSS feeds for software engineering publications
Here are the main RSS feeds for academic publications related to software engineering. The inclusion in the feed depends on the publisher reputation (currently the major academic publishers) and on the ISI Web of Knowledge bibliometrics. >>> read more
December 12, 2010: Graphical Visualization for Understanding the f1-score
F1 is a standard evaluation metric from information retrieval research. It combines the precision and the recall. In order to understand this combination, here is a visualization of the landscape of the F1-score. >>> read more
December 09, 2010: Three statistical formulas for estimating a proportion
This document presents three statistical formulas that give the margin of errors when estimating a proportion, and a piece of code to empirically verifies the formulas. It uses an unusual programming manner for discussing statistics (in contrast to pure maths). >>> read more
October 27, 2009: Academic publishers and bibliographic metadata
Here is a survey on the use of embedded bibliographic metadata by the main academic publishers. The data was collected on 2009-10-24. Please tell me if something has changed. >>> read more
October 27, 2009: Accurate bibliographic metadata and google scholar
Disclaimer: the information in this post is a guess and does not work deterministically. It seems that the [[http://scholar.google.com/intl/en/scholar/inclusion. >>> read more
July 14, 2009: Creating proceedings from pdf files with latex
As publication chair of MDPLE'2009, I had to create the proceedings of the workshop. >>> read more
Share on Twitter