Pages tagged with "academia" on monperrus.net

by Martin Monperrus
#11 A Comparison of Median and Mean (March 26, 2012)
Mean and median are two measures to summarize a data set of N numerical values. However, there are not equivalent. Here is a thorough comparison of their properties. >>> read more
#10 Sciclomatic: The Future of Sharing Scientific Datasets (February 09, 2012)
Martin Monperrus, February 2012 This post sketches a peer-to-peer system for sharing scientific datasets. Permalink: http://www.monperrus.net/martin/sciclomatic-sharing-scientific-datasets Introduction Academics, students and researchers obtain, create or use data in their experiments. >>> read more
#9 Conservation and Replication with CVS-Vintage: A Dataset of CVS Repositories of Java Software (February 09, 2012)
CVS-Vintage is a dataset of CVS repositories of Java applications. It contains the original CVS repository of 14 open source software projects. >>> read more
#8 Standup reading groups (January 16, 2012)
A reading group is a regular meeting in a research group or departement to discuss papers on a certain topic. A standup reading group is a kind of reading group that aims at being lively and easy to organize. Contrary to research seminars, they are not graded (but students are always welcome). >>> read more
#7 sejournals: a RSS and twitter feed for software engineering publications (December 17, 2010)
sejournals is a RSS and twitter feed for journal publications related to software engineering. It is available at http://pipes.yahoo.com/pipes/pipe.run?_id=12f9c0a1eb34af14ce6ad6a49b029121&_render=rss and http://twitter. >>> read more
#6 Understanding the f1-score (December 12, 2010)
F1 is a standard evaluation metric from information retrieval research. It combines the precision and the recall. In order to understand this combination, here is a visualization of the landscape of the F1-score. >>> read more
#5 Three statistical formulas for estimating a proportion (December 09, 2010)
This document presents three statistical formulas that give the margin of errors when estimating a proportion, and a piece of code to empirically verifies the formulas. It uses an unusual programming manner for discussing statistics (in contrast to pure maths). >>> read more
#4 Accurate bibliographic metadata and google scholar (October 27, 2009)
It is often the case that the metadata that are automatically extracted by Google Scholar (e.g. title, authors, etc) are incorrect. There is a way to give Google Scholar the correct bibliographic metadata of your publications. The official guidelines are published at http://scholar.google. >>> read more
#3 academic publishers and bibliographic metadata (October 27, 2009)
Here is a survey on the use of embedded bibliographic metadata by the main academic publishers. The data was collected on 2009-10-24. Please tell me if something has changed. >>> read more
#2 Creating proceedings from pdf files with latex (July 14, 2009)
As publication chair of MDPLE'2009, I had to create the proceedings of the workshop. >>> read more
#1 Pointers on tree differencing algorithms and tools (unknown)
This post presents papers and tools on tree differencing. Tree differencing has been studied in the context of semantic diff of source code, as well as in the context of XML engineering (comparing two XMLs is a tree diff). To a certain extent, the latter is a more generic form of the former. >>> read more