About "academia" on monperrus.net

by Martin Monperrus
November 10, 2018: DBLP search in software engineering journals and conferences
Search in title/author: Quick checks Journals: Conferences: Based on http://github.com/monperrus/academia.json/ $('#input').keypress(function (e) { if (e. >>> read more
November 02, 2018: Open-access Bibliographic APIs
Hopefully useful notes about web APIs to access open-access bibliographic data. Meta API Meta-APIs gather content from different bibliographic repositories (for example 1000+ for BASE).  BASE Documentation: https://api.base-search.net/ Indexed repositories: https://www.base-search. >>> read more
August 30, 2018: Computational Research with GPUs in Sweden
If you need to perform computational experiments based on GPUs in Sweden, here is a list of pointers: Academia SNIC: The "Kebnekaise" resource at HPC2N contains some GPUs. Go to https://supr.snic. >>> read more
July 03, 2018: How to do a good master's thesis at KTH?
Novelty, Soundness, Clarity The goal of a master's thesis is to contribute to science with piece of knowledge that is: novel: nobody has never done this ever, anywhere in the world. >>> read more
May 18, 2018: Keeping arXiv's Light in the Double-blind Obscurity
Over the recent decades, arXiv has created a culture of open-access and early dissemination that has never existed at this scale in science. At the same time, in some disciplines, the peer-review process has become double-blind. >>> read more
May 04, 2018: How to publish a working paper on Arxiv?
It is a very good practice to publish working papers on Arxiv. Here are some tips on how to do this. >>> read more
April 21, 2018: How to make a good open-science repository?
If you're doing some kind of computational or data-intensive research, this post describes how to make a good open-science repository about your digital experimental results. It summarizes my experience in doing such replication packages over years for my software engineering experiments. >>> read more
March 23, 2018: The Multiple Goals and Data in Data-Mining for Software Engineering
Data mining for software engineering consists of collecting software engineering data, extracting some knowledge from it and, if possible, use this knowledge to improve the software engineering process, in other words “operationalize” the mined knowledge. >>> read more
October 20, 2017: Open-science and Double-blind Peer-Review
Recently, double-blind peer-review has fallen over my research community as a storm. Unfortunately, beyond its noble goal of reducing unfairness, double-blind peer-review may have detrimental collateral effects on open-science. >>> read more
May 30, 2017: How to use LuaTeX and LuaLaTeX on arXiv?
The arXiv open-access archive only supports pdftex/pdflatex. If you like using LuaLaTeX and all its wonderful features, it is very likely that you won't be able to compile it identically using pdflatex. >>> read more
May 22, 2017: Copy bibtex to clipboard in Google Scholar
I am addicted to Google Scholar. It has found so many excellent papers related to my research! However, there is one thing for which it sucks: Bibtex export. >>> read more
December 05, 2016: How to receive email notifications about new Arxiv papers?
How to receive notifications about new Arxiv papers in software engineering and programming languages? Email You have to write an email to cs@arxiv.org, with subject subscribe your name and email body: add Programming Languages add Software Engineering The official (and unclear) doc is https://arxiv. >>> read more
September 01, 2016: Makitoo: la réparation à chaud du labo à la startup
Makitoo est une startup issue des travaux de recherche en génie logiciel menés au laboratoire CRISTAL et Inria Lille, dans l'équipe Spirals. Créée en mars 2016, Makitoo propose des solutions de monitoring logiciel couplées avec la réparation à chaud, le tout en production. >>> read more
August 11, 2016: The tasks of a publication chair
What are the duties of a publication chair in a conference? Here are some humble notes from my small experience and the pieces of advice of Francesca Lonetti. >>> read more
August 01, 2016: Interesting pointers about open software science
awesome-open-science-software lists resources about open science and software: software as main experimental tool (aka computational methods) software as main study object (software engineering, programming language, systems software, ...). >>> read more
June 02, 2016: Python script to create an RSS feed from DBLP
Creates an RSS feed with DBLP JSON data from its API. #!/usr/bin/python2.7 """ Creates an RSS feed with DBLP JSON data from its API. Meant to be used as CGI script but can also be used as command-line. Feedback and pull request welcome. >>> read more
April 23, 2016: Open-science and Travis: continuous integration for reproducible scientific experiments
BLUF: one can use the concepts from continuous integration for achieving nearly perfectly reproducible computational experiments. A continuous integration server is a server that compiles and runs automated tests for each commit of a software project. >>> read more
February 13, 2016: How to write a good author response?
Peer-review has many different forms. One classical form is the journal publication process, where authors submit articles, reviewers comment on it, and the authors are invited to submit a revised version based on the comments. >>> read more
January 27, 2016: Lab notes on compiling genprog
Here are some notes on the Genprog automatic repair tool. Note that Genprog is now on Github: https://github.com/squaresLab/genprog-code Compiling version SVN 1688 genprog-code-snapshot-svn1688.tar.gz First compile CIL wget http://downloads.sourceforge.net/project/cil/cil/cil-1.7.3.tar. >>> read more
January 15, 2016: Required qualifications for research
Here are the key qualifications for successful research. be really excited about your project be ok working hard be persistent (get back up and continue when things don't work out as planned because true research rarely works out as planned) be fearless (e.g. >>> read more
November 24, 2015: Open-science and code sharing
In my software engineering research, I'm often curious about the prototypes and experiments of others, I want to know more, beyond what's written in the paper, I want to try, explore and experiment myself. >>> read more
November 23, 2015: Antifragile Software and Chaos Engineering
Antifragile software in a nutshell Antifragile software engineering is the art of building ultra dependable software systems running in ultra open and unpredictable environments. >>> read more
October 16, 2015: Allopatric software diversification
In ecology, allopatric diversification is when the same population is split in two different locations and diversification happens after the split. In this post, we present the concept of “allopatric software diversification”, directly inspired from the ecological concept. >>> read more
September 01, 2015: Martin's Collection of Bug Stories
I work on automatic repair of software bugs. I love good bugs stories. Classical (catastrophic) These are even meme. >>> read more
June 27, 2015: Un flux RSS pour les postes de Galaxie
En France, les postes académiques (Maître de conférence, Professeur) des Universités dépendantes du Ministère de l'Enseignement Supérieur sont publiés sur un site nommé Galaxie. >>> read more
June 23, 2015: A Monte-Carlo study of Spearman correlation coefficient
This is small Python program to study Spearman correlation coefficient in order to find/validate the critical values. >>> read more
May 07, 2015: The "Revise and resubmit as new" option
Recently, as author and reviewer, I discovered that some academic journals use a new kind of notification "Revise and resubmit as new". >>> read more
May 02, 2015: Graphical Understanding of Cohen's d Effect Size
I've just read "A systematic review of effect size in software engineering experiments" (IST, 2007). The authors say that in software engineering, an effect size is considered as: small if Cohen's d is approx. 0.17 medium if Cohen's d is approx. 0.60 large if Cohen's d is approx. 1. >>> read more
October 16, 2014: Automatic software repair tools
This page lists automatic software repair tools. The publicly available ones allow comparative and reproducible open research. If you are aware of other ones, please send me an email. >>> read more
October 15, 2014: A Discussion on Antifragile Software
Questions by Russ Miller and Bett Correa. Answers by Martin Monperrus, based on the paper "Principles of Antifragile Software" Please explain what is meant by Antifragile by Taleb?Antifragile by Taleb is a radically new perspective on errors. >>> read more
June 26, 2014: Automatic software repair, by Martin Monperrus
I've been working on automatic software repair since 2011. I maintain this curated list of pointers for helping newcomers to understand the field. –Martin In a nutshell A tentative definition of the field: Automatic software repair is the process of fixing software bugs automatically. >>> read more
March 28, 2014: Empirical Software Engineering Research: Methodology Papers
Here are important papers about empirical research in software engineering. >>> read more
March 26, 2014: Empirical studies on bugs
I love bugs. I work on automatic software repair. Hence, I am constantly looking for empirical knowledge on bugs. Here are the main papers on this topic. I you know more of them, please drop me an email. >>> read more
January 03, 2014: Introduction to Self-healing Software
My group is inventing concepts and systems for achieving self-healing software. Here is a curated list of pointers to understand the field. –Martin Self-healing software in a nutshell A software system is said to be self-healing if it is capable to handle unexpected error conditions. >>> read more
November 12, 2013: How to write a good scientific review
A review is a critical evaluation of a paper. In a usual peer-review process, a review is meant to be read: by the editor (or the PC chair) and by the authors. The editor/chair uses it to accept/reject the paper, the authors get a wealth of information to improve their paper. >>> read more
June 20, 2013: Companion Web Page for "Empirical Evidence of Large-Scale Diversity in API Usage of Object-Oriented Software"
Raw Dataset: Content: 3 418 Jar files, which include 382 774 different types (classes or interfaces). Source: All Jar files present on a machine used for performing software mining experiments for 7 years. URL: http://ecologyse.gforge.inria.fr/dataset-jars-scam2013.zip (1. >>> read more
April 16, 2013: Six Golden Rules for Writing, Using and Sharing Research Prototypes
A research prototype demonstrates an idea, collects some data for sake of writing a scientific paper. When it's software, it must be considered as a special kind of software. >>> read more
December 31, 2012: The 1000-index: a bibliometric for outstanding scholars
What is the 1000-index? This post proposes a new bibliographic metric, defined as follows: The 1000-index is the number of papers cited at least a thousand times. Google has invented the expression "i10-index" (the number of publications with at least 10 citations). >>> read more
November 25, 2012: Publication lists with HAL and bibtexbrowser
Hal is an open-access archive, similar to Arxiv. It is funded and maintained by French institutions such as CNRS and INRIA. Researchers of French government-funded research bodies are encouraged to provide the world with open-access versions of their papers through HAL. >>> read more
October 31, 2012: How to Do Good Scientific Research?
Introduction "The great thing about the research world is that you get to choose your environment, which consists in large measure of the members of your network. >>> read more
May 25, 2012: Companion Web Page for "Abmash: Mashing Up Legacy Web Applications by Automated Imitation of Human Actions"
Demo Code and Documentation See http://github. >>> read more
April 11, 2012: Pointers on abstract syntax tree differencing algorithms and tools
This post presents papers and tools on semantic source code differencing. It is a special kind of tree differencing. Unix diff and successors (CVS, GIT diff) are line-based. On the contrary, semantic source code diff work on the abstract syntax tree (AST) [1,2,3,4,5,6,11]. >>> read more
March 26, 2012: A Comparison of Median and Mean
Mean and median are two measures to summarize a data set of N numerical values. However, there are not equivalent. Here is a thorough comparison of their properties. >>> read more
February 13, 2012: Sharing scientific datasets with BitTorrent
Bittorrent is an appropriate technology to share scientific datasets because it relieves web servers of distributing many copies of large files and enables a kind of replication. This documents presents how to share scientific datasets with bitorrents. First, you have to set a tracker and a seeder. >>> read more
February 09, 2012: Sciclomatic: A Peer-to-Peer System for Sharing Scientific Datasets
This post sketches a peer-to-peer system for sharing scientific datasets. Permalink: http://www.monperrus.net/martin/sciclomatic-sharing-scientific-datasets Introduction Academics, students and researchers obtain, create or use data in their experiments. >>> read more
February 09, 2012: The CVS-Vintage Dataset of 14 CVS repositories
CVS-Vintage is a dataset of 14 CVS repositories of Java applications. Paper CVS-Vintage: A Dataset of 14 CVS Repositories of Java Software Dataset Light version -- only Java files: http://hal.archives-ouvertes.fr/docs/00/76/91/21/ANNEX/cvs-vintage-light-v20120509.tar. >>> read more
October 14, 2011: Companion Web Page for "Detecting Missing Method Calls As Violations of the Majority Rule"
This page is the companion web page for "[[http://www.monperrus.net/martin/bibtexbrowser.php?key=Monperrus2011b&bib=monperrus.bib|Detecting Missing Method Calls As Violations of the Majority Rule]]" (Martin Monperrus and Mira Mezini). >>> read more
December 17, 2010: RSS feeds for software engineering publications
Here are the main RSS feeds for academic publications related to software engineering. The inclusion in the feed depends on the publisher reputation (currently the major academic publishers) and on the ISI Web of Knowledge bibliometrics. Raw data in JSON: https://github.com/monperrus/academia. >>> read more
December 12, 2010: Graphical Visualization for Understanding the f1-score
F1 is a standard evaluation metric from information retrieval research. It combines the precision and the recall. In order to understand this combination, here is a visualization of the landscape of the F1-score. >>> read more
December 09, 2010: Three statistical formulas for estimating a proportion
This document presents three statistical formulas that give the margin of errors when estimating a proportion, and a piece of code to empirically verifies the formulas. It uses an unusual programming manner for discussing statistics (in contrast to pure maths). >>> read more
October 27, 2009: Accurate bibliographic metadata and google scholar
Disclaimer: the information in this post is a guess and does not work deterministically. It seems that the [[http://scholar.google.com/intl/en/scholar/inclusion. >>> read more
October 27, 2009: Academic publishers and bibliographic metadata
Here is a survey on the use of embedded bibliographic metadata by the main academic publishers. The data was collected on 2009-10-24. Please tell me if something has changed. >>> read more
July 14, 2009: Creating proceedings from pdf files with latex
As publication chair of MDPLE'2009, I had to create the proceedings of the workshop. >>> read more
September 11, 2007: bibtexbrowser: publication lists with bibtex and PHP
bibtexbrowser is a PHP script that creates publication lists from Bibtex files. bibtexbrowser is stable, mature and easy to install. It is used in hundreds of different universities and institutions (over 469 different domains according to Google). >>> read more