Modern CPUs provide single instruction-multiple data (SIMD) instructions. SIMD instructions process several elements of a primitive data type simultaneously in fixed-size vectors. Classical sorting algorithms are not directly expressible in SIMD instructions. Accelerating sorting algorithms with SIMD instruction is therefore a creative endeavor. A promising approach for sorting with SIMD instructions is to use sorting networks for small arrays and Quicksort for large arrays. In this paper we improve vectorization techniques for sorting networks and Quicksort. In particular, we show how to use the full capacity of vector registers in sorting networks and how to make vectorized Quicksort robust with respect to different key distributions. To demonstrate the performance of our techniques we implement an in-place hybrid sorting algorithm for the data type int with AVX2 intrinsics. Our implementation is at least 30% faster than state-of-the-art high-performance sorting alternatives.


    Zugriff

    Download


    Exportieren, teilen und zitieren



    Titel :

    Fast and Robust Vectorized In-Place Sorting of Primitive Types


    Beteiligte:
    Blacher, Mark (Autor:in) / Giesen, Joachim (Autor:in) / Kühne, Lars (Autor:in)

    Kongress:

    2021 ; Online



    Erscheinungsdatum :

    2021-06-01



    Medientyp :

    Aufsatz (Konferenz)


    Format :

    Elektronische Ressource


    Sprache :

    Englisch




    Perfecting Vectorized Mechanical Drawings

    Chen, Y. / Langrana, N. A. / Das, A. K. | British Library Online Contents | 1996


    A vectorized solution for incompressible flow

    PATEL, N. / THOMPSON, J. | AIAA | 1984


    Feature Identification from Vectorized Mechanical Drawings

    Langrana, N. A. / Chen, Y. / Das, A. K. | British Library Online Contents | 1997


    AGENT TRAJECTORY PREDICTION USING VECTORIZED INPUTS

    GAO JIYANG / SHEN YI / ZHAO HANG et al. | Europäisches Patentamt | 2023

    Freier Zugriff

    AGENT TRAJECTORY PREDICTION USING VECTORIZED INPUTS

    GAO JIYANG / SHEN YI / ZHAO HANG et al. | Europäisches Patentamt | 2021

    Freier Zugriff