Abstract is missing.
- Four Decades of Cluster ComputingGerhard R. Joubert, Anthony J. Maeder. 3-8 [doi]
- Will We Ever Have a Quantum Computer?M. I. Dyakonov. 11-15 [doi]
- Empowering Parallel Computing with Field Programmable Gate ArraysErik H. D'Hollander. 16-31 [doi]
- First Experiences on Applying Deep Learning Techniques to Prostate Cancer DetectionEduardo José Gómez-Hernández, José Manuel García. 35-44 [doi]
- Deep Generative Model Driven Protein Folding SimulationsHeng Ma, Debsindhu Bhowmik, Hyungro Lee, Matteo Turilli, Michael T. Young, Shantenu Jha, Arvind Ramanathan. 45-55 [doi]
- A Scalable Approach to Econometric InferencePhilip Nadler, Rossella Arcucci, Yi-Ke Guo. 59-68 [doi]
- Cloud vs On-Premise HPC: A Model for Comprehensive Cost AssessmentMarco Ferretti, Luigi Santangelo. 69-80 [doi]
- GPU Architecture for Wavelet-Based Video Coding AccelerationCarlos de Cea-Dominguez, Juan C. Moure, Joan Bartrina-Rapesta, Francesc Aulí Llinàs. 83-92 [doi]
- GPGPU Computing for Microscopic Pedestrian SimulationBenedikt Zönnchen, Gerta Köster. 93-104 [doi]
- High Performance Eigenvalue Solver for Hubbard Model: Tuning Strategies for LOBPCG Method on CUDA GPUSusumu Yamada, Masahiko Machida, Toshiyuki Imamura. 105-113 [doi]
- Parallel Smoothers in Multigrid Method for Heterogeneous CPU-GPU EnvironmentNeha Iyer, Sashikumaar Ganesan. 114-123 [doi]
- Progressive Load Balancing in Distributed MemoryJusts Zarins, Michèle Weiland. 127-136 [doi]
- Learning-Based Load Balancing for Massively Parallel Simulations of Hot Fusion PlasmasTheresa Pollinger, Dirk Pflüger. 137-146 [doi]
- Load-Balancing for Large-Scale Soot Particle Agglomeration SimulationsSteffen Hirschmann, Andreas Kronenburg, Colin W. Glass, Dirk Pflüger. 147-156 [doi]
- On the Autotuning of Task-Based Numerical Libraries for Heterogeneous ArchitecturesEmmanuel Agullo, Jesús Cámara, Javier Cuenca, Domingo Giménez. 157-166 [doi]
- Batched 3D-Distributed FFT Kernels Towards Practical DNS CodesToshiyuki Imamura, Masaaki Aoki, Mitsuo Yokokawa. 169-178 [doi]
- On Superlinear Speedups of a Parallel NFA Induction AlgorithmTomasz Jastrzab. 179-188 [doi]
- A Domain Decomposition Reduced Order Model with Data Assimilation (DD-RODA)Rossella Arcucci, César Quilodrán Casas, Dunhui Xiao, Laetitia Mottet, Fangxin Fang, Pin Wu, Christopher C. Pain, Yi-Ke Guo. 189-198 [doi]
- Predicting Performance of Classical and Modified BiCGStab Iterative MethodsBoris I. Krasnopolsky. 199-206 [doi]
- Gadget3 on GPUs with OpenACCAntonio Ragagnin, Klaus Dolag, Mathias Wagner, Claudio Gheller, Conradin Roffler, David Goz, David Hubber, Alexander Arth. 209-218 [doi]
- Exploring High Bandwidth Memory for PET Image ReconstructionDai Yang, Tilman Küstner, Rami G. Al Rihawi, Martin Schulz 0001. 219-228 [doi]
- The Architecture of Heterogeneous Petascale HPC RIVRMiran Ulbin, Zoran Ren. 231-240 [doi]
- Design of an FPGA-Based Matrix Multiplier with Task ParallelismYiyu Tan, Toshiyuki Imamura, Daichi Mukunoki. 241-250 [doi]
- Application Performance of Physical System SimulationsVladimir Getov, Peter M. Kogge, Thomas M. Conte. 251-260 [doi]
- A Hybrid MPI+Threads Approach to Particle Group Finding Using Union-FindJames S. Willis, Matthieu Schaller, Pedro Gonnet, John C. Helly. 263-274 [doi]
- Improving the Scalability of the ABCD Solver with a Combination of New Load Balancing and Communication Minimization TechniquesIain S. Duff, Philippe Leleux, Daniel Ruiz 0002, F. Sukru Torun. 277-286 [doi]
- Characterization of Power Usage and Performance in Data-Intensive Applications Using MapReduce over MPIJoshua Hoke Davis, Tao Gao, Sunita Chandrasekaran, Heike Jagode, Anthony Danalis, Jack J. Dongarra, Pavan Balaji, Michela Taufer. 287-298 [doi]
- Feedback-Driven Performance and Precision Tuning for Automatic Fixed Point ExploitationDaniele Cattaneo, Michele Chiari, Stefano Cherubin, Antonio Di Bello, Giovanni Agosta. 299-308 [doi]
- A GPU-CUDA Framework for Solving a Two-Dimensional Inverse Anomalous Diffusion ProblemPasquale De Luca, Ardelio Galletti, Hadi Roohani Ghehsareh, Livia Marcellino, Marzie Raei. 311-320 [doi]
- Parallelization Strategies for GPU-Based Ant Colony Optimization Applied to TSPBreno Augusto De Melo Menezes, Luis Filipe de Araujo Pessoa, Herbert Kuchen, Fernando Buarque de Lima Neto. 321-330 [doi]
- DBCSR: A Blocked Sparse Tensor Algebra LibraryIlia Sivkov, Patrick Seewald, Alfio Lazzaro, Jürg Hutter. 331-340 [doi]
- Acceleration of Hydro Poro-Elastic Damage Simulation in a Shared-Memory EnvironmentHarel Levin, Gal Oren 0001, Eyal Shalev, Vladimir Lyakhovsky. 341-353 [doi]
- BERTHA and PyBERTHA: State of the Art for Full Four-Component Dirac-Kohn-Sham CalculationsLoriano Storchi, Matteo de Santis, Leonardo Belpassi. 354-363 [doi]
- Prediction-Based Partitions Evaluation Algorithm for Resource AllocationAnna Pupykina, Giovanni Agosta. 364-375 [doi]
- Unified Generation of DG-Kernels for Different HPC FrameworksJan Hönig, Marcel Koch, Ulrich Rüde, Christian Engwer, Harald Köstler. 376-385 [doi]
- Invasive Computing for Power Corridor ManagementJophin John, Santiago Narváez, Michael Gerndt. 386-395 [doi]
- Enforcing Reference Capability in FastFlow with RustLuca Rinaldi, Massimo Torquati, Marco Danelutto. 396-405 [doi]
- AITuning: Machine Learning-Based Tuning Tool for Run-Time Communication LibrariesAlessandro Fanfarillo, Davide Del Vento. 409-418 [doi]
- Towards Benchmarking the Asynchronous Progress of Non-Blocking MPI OperationsAlexey V. Medvedev. 419-428 [doi]
- Acceleration of Interactive Multiple Precision Arithmetic Toolbox MuPAT Using FMA, SIMD, and OpenMPHotaka Yagi, Emiko Ishiwata, Hidehiko Hasegawa. 431-440 [doi]
- Dynamic Runtime and Energy Optimization for Power-Capped HPC ApplicationsBo Wang, Christian Terboven, Matthias S. Müller. 441-452 [doi]
- Paradigm Shift in Program Structure of Particle-in-Cell SimulationsTakayuki Umeda. 455-464 [doi]
- Backus FP Revisited: A Parallel Perspective on Modern MulticoresAlessandro Di Giorgio, Marco Danelutto. 465-474 [doi]
- Multi-Variant User Functions for Platform-Aware Skeleton ProgrammingAugust Ernstsson, Christoph W. Kessler. 475-484 [doi]
- POETS: Distributed Event-Based Computing - Scaling BehaviourAndrew Brown, Mark Vousden, Alex Rast, Graeme Bragg, David B. Thomas, Jonny Beaumont, Matthew Naylor, Andrey Mokhov. 487-496 [doi]
- Towards High-End Scalability on Biologically-Inspired Computational ModelsDario Dematties, George K. Thiruvathukal, Silvio Rizzi, Alejandro Wainselboim, B. Silvano Zanutto. 497-506 [doi]
- GraphiX: A Fast Human-Computer Interaction Symmetric Multiprocessing Parallel Scientific Visualization ToolRe'em Harel, Gal Oren 0001. 509-520 [doi]
- When Parallel Performance Measurement and Analysis Meets In Situ Analytics and VisualizationAllen D. Malony, Matthew Larsen, Kevin A. Huck, Chad Wood, Sudhanshu Sane, Hank Childs. 521-530 [doi]
- Seamless Parallelism Management for Video Stream Processing on Multi-CoresAdriano Vogel, Dalvan Griebler, Luiz Gustavo Fernandes, Marco Danelutto. 533-542 [doi]
- High-Level Stream Parallelism Abstractions with SPar Targeting GPUsDinei A. Rockenbach, Dalvan Griebler, Marco Danelutto, Luiz Gustavo Fernandes. 543-552 [doi]
- Energy-Efficiency Evaluation of FPGAs for Floating-Point Intensive WorkloadsEnrico Calore, Sebastiano Fabio Schifano. 555-564 [doi]
- GPU Acceleration of Four-Site Water Models in LAMMPSVsevolod P. Nikolskiy, Vladimir V. Stegailov. 565-573 [doi]
- Energy Consumption of MD Calculations on Hybrid and CPU-Only Supercomputers with Air and Immersion CoolingEkaterina Dlinnova, Sergey Biryukov, Vladimir V. Stegailov. 574-582 [doi]
- Direct N-Body Application on Low-Power and Energy-Efficient Parallel ArchitecturesGeorgios Ieronymakis, Vassilis Papaefstathiou, Nikolaos Dimou, Sara Bertocco, Antonio Ragagnin, Luca Tornatore, Giuliano Taffoni, Igor Coretti. 583-592 [doi]
- Performance and Energy Efficiency of CUDA and OpenCL for GPU Computing Using PythonHåvard H. Holm, André R. Brodtkorb, Martin Lilleeng Sætra. 593-604 [doi]
- Computational Performances and Energy Efficiency Assessment for a Lattice Boltzmann Method on Intel KNLIvan Girotto, Sebastiano Fabio Schifano, Enrico Calore, Gianluca Di Staso, Federico Toschi. 605-613 [doi]
- Performance, Power Consumption and Thermal Behavioral Evaluation of the DGX-2 PlatformMatej Spetko, Lubomir Riha, Branislav Jansik. 614-623 [doi]
- On the Performance and Energy Efficiency of Sparse Matrix-Vector Multiplication on FPGAsPanagiotis Mpakos, Nikela Papadopoulou, Chloe Alverti, Georgios I. Goumas, Nectarios Koziris. 624-633 [doi]
- Evaluation of DVFS and Uncore Frequency Tuning Under Power Capping on Intel Broadwell ArchitectureLubomir Riha, Ondrej Vysocky, Andrea Bartolini. 634-643 [doi]
- ELPA: A Parallel Solver for the Generalized Eigenvalue ProblemHans-Joachim Bungartz, Christian Carbogno, Martin Galgon, Thomas Huckle, Simone S. Köcher, Hagen-Henrik Kowalski, Pavel Kus, Bruno Lang, Hermann Lederer, Valeriy Manin, Andreas Marek, Karsten Reuter, Michael Rippl, Matthias Scheffler, Christoph Scheurer. 647-668 [doi]
- Parallel Totally Induced Edge Sampling on FPGAsAkshit Goel, Sanmukh R. Kuppannagari, Yang Yang, Ajitesh Srivastava, Viktor K. Prasanna. 671-680 [doi]
- An Implementation of Non-Local Means Algorithm on FPGAHayato Koizumi, Tsutomu Maruyama. 681-690 [doi]
- Accelerating Binarized Convolutional Neural Networks with Dynamic Partial Reconfiguration on Disaggregated FPGAsPanagiotis Skrimponis, Emmanouil Pissadakis, Nikolaos Alachiotis, Dionisios N. Pnevmatikatos. 691-700 [doi]
- Porting a Lattice Boltzmann Simulation to FPGAs Using OmpSsEnrico Calore, Sebastiano Fabio Schifano. 701-710 [doi]
- A Processor Architecture for Executing Global Cellular Automata as SoftwareChristian Ristig, Christian Siemers. 711-720 [doi]
- Crossbar Implementation with Partial Reconfiguration for Stream Switching Applications on an FPGAYuichi Kawamata, Tomohiro Kida, Yuichiro Shibata, Kentaro Sano. 721-730 [doi]
- Replicating Machine Learning Experiments in Materials ScienceLine Pouchard, Yuewei Lin, Hubertus van Dam. 743-755 [doi]
- Documenting Computing Environments for Reproducible ExperimentsJason Chuah, Madeline Deeds, Tanu Malik, Youngdon Choi, Jonathan L. Goodall. 756-765 [doi]
- Toward Enabling Reproducibility for Data-Intensive Research Using the Whole Tale PlatformKyle Chard, Niall Gaffney, Mihael Hategan, Kacper Kowalik, Bertram Ludäscher, Timothy M. McPhillips, Jarek Nabrzyski, Victoria Stodden, Ian J. Taylor, Thomas Thelen, Matthew J. Turk, Craig Willis. 766-778 [doi]