About "academia" on monperrus.net

by Martin Monperrus
#41 Open-science and Double-blind Peer-Review (October 20, 2017)
Recently, double-blind peer-review has fallen over my research community as a storm. Unfortunately, beyond its noble goal of reducing unfairness, double-blind peer-review may have detrimental collateral effects on open-science. >>> read more
#40 Copy bibtex to clipboard in Google Scholar (May 22, 2017)
I am addicted to Google Scholar. It has found so many excellent papers related to my research! However, there is one thing for which it sucks: Bibtex export. >>> read more
#39 How to receive email notifications about new Arxiv papers? (December 05, 2016)
How to receive notifications about new Arxiv papers in software engineering and programming languages? Email You have to write an email to cs@arxiv.org, with subject subscribe your name and email body: add Programming Languages add Software Engineering The official (and unclear) doc is https://arxiv. >>> read more
#38 Cleaning paper sources for open-access websites such as Arxiv (November 09, 2016)
Some open-access archive websites require to upload the sources of the paper (eg Latex sources). However, the sources may contain comments that are either private or not appropriate for public viewing and eternity archival. >>> read more
#37 Makitoo: la réparation à chaud du labo à la startup (September 01, 2016)
Présentation: Makitoo est une startup issue des travaux de recherche en génie logiciel menés au laboratoire CRISTAL et Inria Lille, dans l'équipe Spirals. Créée en mars 2016, Makitoo propose des solutions de monitoring logiciel couplées avec la réparation à chaud, le tout en production. >>> read more
#36 Interesting pointers about open software science (August 01, 2016)
awesome-open-science-software lists resources about open science and software: software as main experimental tool (aka computational methods) software as main study object (software engineering, programming language, systems software, ...). >>> read more
#35 Open-science and Travis: continuous integration for reproducible scientific experiments (April 23, 2016)
BLUF: one can use the concepts from continuous integration for achieving nearly perfectly reproducible computational experiments. A continuous integration server is a server that compiles and runs automated tests for each commit of a software project. >>> read more
#34 How to write a good author response? (February 13, 2016)
Peer-review has many different forms. One classical form is the journal publication process, where authors submit articles, reviewers comment on it, and the authors are invited to submit a revised version based on the comments. >>> read more
#33 Required qualifications for research (January 15, 2016)
Here are the key qualifications for successful research. be really excited about your project be ok working hard be persistent (get back up and continue when things don't work out as planned because true research rarely works out as planned) be fearless (e.g. >>> read more
#32 Open-science and code sharing (November 24, 2015)
In my software engineering research, I'm often curious about the prototypes and experiments of others, I want to know more, beyond what's written in the paper, I want to try and experiment myself. >>> read more
#31 Antifragile Software (November 23, 2015)
"How I Stopped Worrying and Learned to Love Software Errors" Antifragile software in a nutshell Antifragile software engineering is the art of building ultra dependable software systems running in ultra open and unpredictable environments. >>> read more
#30 Allopatric software diversification (October 16, 2015)
In ecology, allopatric diversification is when the same population is split in two different locations and diversification happens after the split. In this post, we present the concept of ``allopatric software diversification'', directly inspired from the ecological concept. >>> read more
#29 Un flux RSS pour les postes de Galaxie (June 27, 2015)
En France, les postes académiques (Maître de conférence, Professeur) des Universités dépendantes du Ministère de l'Enseignement Supérieur sont publiés sur un site nommé Galaxie. >>> read more
#28 A Monte-Carlo study of Spearman correlation coefficient (June 23, 2015)
This is small Python program to study Spearman correlation coefficient in order to find/validate the critical values. >>> read more
#27 The "Revise and resubmit as new" option (May 07, 2015)
Recently, as author and reviewer, I discovered that some academic journals use a new kind of notification "Revise and resubmit as new". >>> read more
#26 Graphical Understanding of Cohen's d Effect Size (May 02, 2015)
I've just read "A systematic review of effect size in software engineering experiments" (IST, 2007). The authors say that in software engineering, an effect size is considered as: small if Cohen's d is approx. 0.17 medium if Cohen's d is approx. 0.60 large if Cohen's d is approx. 1. >>> read more
#25 RSS feeds for software engineering jobs in academia (April 12, 2015)
No preview available >>> read more
#24 The MineAPI Project (December 05, 2014)
The project MineAPI has been accepted! MineAPI is about "Mining software repositories to instantiate software frameworks and react to API changes". It is funded within the Stic-Amsud initiative. >>> read more
#23 A Discussion on Antifragile Software (October 15, 2014)
Questions by Russ Miller and Bett Correa. Answers by Martin Monperrus, based on the paper "Principles of Antifragile Software" Please explain what is meant by Antifragile by Taleb?Antifragile by Taleb is a radically new perspective on errors. >>> read more
#22 Automatic software repair (June 26, 2014)
Awesome repair. I maintain this curated list of pointers about automatic software repair. Automatic software repair in a nutshell A tentative definition of the field: Automatic software repair is the process of fixing software bugs automatically. >>> read more
#21 Software Engineering DBLP Search (May 08, 2014)
Here is a Javacript interface for searching papers in DBLP with easy filters on software engineering (or related) conferences and journals. >>> read more
#20 Notes on program synthesis (May 07, 2014)
Here are some notes on program synthesis. >>> read more
#19 Empirical Software Engineering Research: Methodology Papers (March 28, 2014)
Here are some important papers about empirical research in software engineering. >>> read more
#18 Empirical studies on bugs (March 26, 2014)
I love bugs. I work on automatic software repair. Hence, I am constantly looking for empirical knowledge on bugs. Here are the main papers on this topic. I you know more of them, please drop me an email. >>> read more
#17 What is self-healing software? (January 03, 2014)
What is self-healing software? "Self-healing software provides automatic responses to faults of a given fault model in a certain design context" [4]. I recommend Koopman's paper [4] and Keromytis' paper [3] for an excellent overview of the field. >>> read more
#16 Data-mining for software engineering (December 23, 2013)
Data mining for software engineering consists of collecting software engineering data, extracting some knowledge from it and, if possible, use this knowledge to improve the software engineering process, in other words “operationalize” the mined knowledge. >>> read more
#15 6 Golden Rules for Writing, Using and Sharing Research Prototypes (April 16, 2013)
A research prototype demonstrates an idea, collects some data for sake of writing a scientific paper. When it's software, it must be considered as a special kind of software. >>> read more
#14 The 1000-index (December 31, 2012)
What is the 1000-index? This post proposes a new bibliographic metric, defined as follows: The 1000-index is the number of papers cited at least a thousand times. I use Google Scholar as citation database to compute it. >>> read more
#13 How to Do Good Scientific Research? (October 31, 2012)
Thinking of doing a Ph.D, a research internship? Already started one? Here are skills you learn to master in a research environment. There may be some bias towards science and esp. computer science. >>> read more
#12 Pointers on abstract syntax tree differencing algorithms and tools (April 11, 2012)
This post presents papers and tools on semantic source code differencing. It is a special kind of tree differencing. Semantic source code diff Unix diff and successors (CVS, GIT diff) are line-based. On the contrary, semantic source code diff work on the abstract syntax tree (AST) [1,2,3,4,5,6,11]. >>> read more
#11 A Comparison of Median and Mean (March 26, 2012)
Mean and median are two measures to summarize a data set of N numerical values. However, there are not equivalent. Here is a thorough comparison of their properties. >>> read more
#10 Sharing scientific datasets with BitTorrent (February 13, 2012)
Bittorrent is an appropriate technology to share scientific datasets because it relieves web servers of distributing many copies of large files and enables a kind of replication. This documents presents how to share scientific datasets with bitorrents. First, you have to set a tracker and a seeder. >>> read more
#9 Sciclomatic: A Peer-to-Peer System for Sharing Scientific Datasets (February 09, 2012)
This post sketches a peer-to-peer system for sharing scientific datasets. Permalink: http://www.monperrus.net/martin/sciclomatic-sharing-scientific-datasets Introduction Academics, students and researchers obtain, create or use data in their experiments. >>> read more
#8 Resources for the CVS-Vintage Dataset (February 09, 2012)
CVS-Vintage is a dataset of 14 CVS repositories of Java applications. URLs Light version -- only Java files (HTTP): http://hal.archives-ouvertes.fr/docs/00/76/91/21/ANNEX/cvs-vintage-light-v20120509.tar.bz2 (46MB) Full version https://zenodo. >>> read more
#7 Standup Reading Groups (January 16, 2012)
A reading group is a regular meeting in a research group or departement to discuss papers on a certain topic. A standup reading group is a kind of reading group that aims at being lively and easy to organize. Contrary to research seminars, they are not graded (but students are always welcome). >>> read more
#6 RSS feeds for software engineering publications (December 17, 2010)
Here are the main RSS feeds for academic publications related to software engineering. The inclusion in the feed depends on the publisher reputation (currently the major academic publishers) and on the ISI Web of Knowledge bibliometrics. >>> read more
#5 Understanding the f1-score (December 12, 2010)
F1 is a standard evaluation metric from information retrieval research. It combines the precision and the recall. In order to understand this combination, here is a visualization of the landscape of the [[http://en.wikipedia.org/wiki/F1_score|F1-score]]. >>> read more
#4 Three statistical formulas for estimating a proportion (December 09, 2010)
This document presents three statistical formulas that give the margin of errors when estimating a proportion, and a piece of code to empirically verifies the formulas. It uses an unusual programming manner for discussing statistics (in contrast to pure maths). >>> read more
#3 Academic publishers and bibliographic metadata (October 27, 2009)
Here is a survey on the use of embedded bibliographic metadata by the main academic publishers. The data was collected on 2009-10-24. Please tell me if something has changed. >>> read more
#2 Accurate bibliographic metadata and google scholar (October 27, 2009)
Disclaimer: the information in this post is a guess and does not work deterministically. It seems that the [[http://scholar.google.com/intl/en/scholar/inclusion. >>> read more
#1 Creating proceedings from pdf files with latex (July 14, 2009)
As publication chair of MDPLE'2009, I had to create the proceedings of the workshop. >>> read more