Support -mpcu flag and autodetect to trigger vectorization on proper vector lengths #588

nicolasvasilache · 2018-07-26T15:33:00Z

This PR passes proper llvm::TargetMachine information in
llvm_jit and codegen_llvm by introducing a proper TargetMachine
at the LLVMJit level and avoids introducing adhoc objects.

The TargetMachine is constructed either from the --mcpu flag if
passed or from the cpuid information.

As a consequence of all this, one can now emit AVX2 and AVX512 code.
Before this commit, the TargetMachine was essentially a default one
and only AVX code would be generated.

To test and see it one can run with:

cd build && \
make -j 16 test_mapper_llvm && \
./test/test_mapper_llvm --logtostderr=1 --llvm_dump_asm=1 --llvm_dump_after_opt=1 --llvm_dump_before_opt=1 --gtest_filter="*Batch*" --mcpu=skylake

Of course if one forces a more fancy architecture than one has,
illegal instructions will likely be generated but at least the asm
will be printed properly.

It will be reused in another location in the next commit.

We are now on trunk this was leftover from Tapir days.

This commit introduces a --llvm_dump_asm flag and the corresponding --llvm_dump_asm_options to emit assembly to LOG(INFO)

This commit adds the --mcpu flag to TC so we can emit LLVM and asm for different architectures.

This commit introduces and uses cpuid information to pass the proper llc `mcpu` flag.

This commit passes proper llvm::TargetMachine information in llvm_jit and codegen_llvm by introducing a proper TargetMachine at the LLVMJit level and avoids introducing adhoc objects. The TargetMachine is constructed either from the `--mcpu` flag if passed or from the `cpuid` information. As a consequence of all this, one can now emit AVX2 and AVX512 code. Before this commit, the TargetMachine was essentially a default one and only AVX code would be generated. To test and see it one can run with: ``` cd build && \ make -j 16 test_mapper_llvm && \ ./test/test_mapper_llvm --logtostderr=1 --llvm_dump_asm=1 --llvm_dump_after_opt=1 --llvm_dump_before_opt=1 --gtest_filter="*Batch*" --mcpu=skylake ``` Of course if one forces a more fancy architecture than one has, illegal instructions will likely be generated but at least the asm will be printed properly.

This commit avoids leaking all the guts of LLVMCodegen to the emitLLVMKernel function and restructures some code.

facebook-github-bot · 2018-08-17T21:08:30Z

Thank you for your pull request. We require contributors to sign our Contributor License Agreement, and yours has expired.

Before we can review or merge your code, we need you to email [email protected] with your details so we can update your status.

nicolasvasilache added 4 commits July 25, 2018 06:30

Extract checkedSystemCall function

7b0bd2c

It will be reused in another location in the next commit.

Cleanup non-trunk LLVM remnants

df64aee

We are now on trunk this was leftover from Tapir days.

Add support for printing emitted assembly

886b563

This commit introduces a --llvm_dump_asm flag and the corresponding --llvm_dump_asm_options to emit assembly to LOG(INFO)

Add mcpu flag for future use

0079a95

This commit adds the --mcpu flag to TC so we can emit LLVM and asm for different architectures.

nicolasvasilache requested review from wsmoses, ftynse and skimo-openhub July 26, 2018 15:33

facebook-github-bot added the CLA Signed label Jul 26, 2018

nicolasvasilache added 3 commits July 27, 2018 02:52

Introduce CPUID

ed0d65d

This commit introduces and uses cpuid information to pass the proper llc `mcpu` flag.

Refactor LLVMCodegen

0caa810

This commit avoids leaking all the guts of LLVMCodegen to the emitLLVMKernel function and restructures some code.

nicolasvasilache force-pushed the pr/more-cpu-mapper branch from c6e1989 to 0caa810 Compare July 27, 2018 09:53

nicolasvasilache changed the title ~~Pr/more cpu mapper~~ Support -mpcu flag and autodetect to trigger vectorization on proper vector lengths Jul 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support -mpcu flag and autodetect to trigger vectorization on proper vector lengths #588

Support -mpcu flag and autodetect to trigger vectorization on proper vector lengths #588

nicolasvasilache commented Jul 26, 2018 •

edited

Loading

facebook-github-bot commented Aug 17, 2018

Support -mpcu flag and autodetect to trigger vectorization on proper vector lengths #588

Are you sure you want to change the base?

Support -mpcu flag and autodetect to trigger vectorization on proper vector lengths #588

Conversation

nicolasvasilache commented Jul 26, 2018 • edited Loading

facebook-github-bot commented Aug 17, 2018

nicolasvasilache commented Jul 26, 2018 •

edited

Loading