BenchGen

Benchmark Generation via L-Systems

About:

BenchGen implements an L-System to generate programs from a seed string and a set of production rules, enabling the creation of large programs through iterative expansion of the L-System. The program generator is entirely written in C++. To know more about how BenchGen works, you can read a brief report about it.

The BenchGen is develop in Compilers Lab and is financed by FAPEMIG and Google. We appreciate their support and contributions to the development of this project.

Releases:

• C Language Support:

v1.0.0-alpha

• Multi Language Support:

v1.0.0-beta

Installing and Running:

After cloning the repository, you can build the project by running the make command in the src/gen directory. Notice that clang++ is used as the default compiler.

To run BenchGen, you need to provide the following five parameters: An example of usage is:

Run in alpha version:

git clone --branch v1.0.0-alpha https://github.com/lac-dcc/BenchGen.git
make -C ./BenchGen/src/gen/

./benchGen 1 productionRule.txt seedString.txt myProgram array

Run in beta version:

git clone --branch v1.0.0-beta https://github.com/lac-dcc/BenchGen.git
make -C ./BenchGen/src/gen/

./benchGen 1 productionRule.txt seedString.txt myProgram array programmingLanguage

See more: here

How to create a benchmark via L-grammar

The L-system grammar used by BenchGen currently supports three commands:

IF – conditional statements (if-then-else)
LOOP – iterative structures (for and while)
CALL – function calls

BenchGen also employs data structures in its benchmarks, providing four functions for data manipulation:

new: Creates and initializes a data structure.
insert: Adds an element to a data structure in scope.
remove: Deletes an element from a data structure in scope.
contains: Checks whether an element exists in a data structure in scope.

Based on this, we can define production rules that describe how the benchmark will be structured:

Axiom

CALL(new A)

Production Rules

A = LOOP(B C contains);
B = IF(new LOOP(remove contains), new A remove contains);
C = new contains B;

This program, for instance, begins with a function call that creates a new data structure using the new command. Then, a loop structure is created according to rule A, which calls rules B and C, checking the structure using the contains command.

For more details, our technical report provides additional information on the syntax of control structures in Section 3.1.

Contributing

BenchGen, in its beta version, supports generating benchmarks in multiple programming languages. To contribute by adding new programming languages, you can follow this documentation, which explains step-by-step how to add and use BenchGen with your programming language!
To contribute to the alpha version of BenchGen, we have open issues where you can help us make BenchGen even better.
You can also report bugs by opening an issue here.

Fork on GitHub

Team

Vinicius Francisco da Silva - Federal University of Minas Gerais (UFMG)
Heitor Leite - Federal University of Minas Gerais (UFMG)
Fernando Magno Quintão Pereira - Federal University of Minas Gerais (UFMG)