Abstract is missing.
- Computing Everywhere, All at Once: Harnessing the Computing Continuum for ScienceManish Parashar. [doi]
- Modern AI for Analyzing Large Structured Databases: Opportunities and ChallengesSunita Sarawagi. [doi]
- High Performance and Energy Efficient Processor for Next Generation Data Centres: FUJITSU - MONAKAPriyanka Sharma. [doi]
- Addressing Exponential Scale Problems at InfosysVittal Setty. [doi]
- DNA-TEQ: An Adaptive Exponential Quantization of Tensors for DNN InferenceBahareh Khabbazan, Marc Riera, Antonio González 0001. 1-10 [doi]
- PARAG: PIM Architecture for Real-Time Acceleration of GCNsGian Singh, Sanmukh R. Kuppannagari, Sarma B. K. Vrudhula. 11-20 [doi]
- Hybrid CUDA Unified Memory Management in Fully Homomorphic Encryption WorkloadsJake Choi, Jaejin Lee, Sunchul Jung, Heon Young Yeom. 21-30 [doi]
- Mobile Gaming Experience: An Approach Based on Thread Scheduler & Thread Priority ManagerShaik Jani Basha, Sandani Shaik, Nazrinbanu Nagori, Veerendra Shetty. 31-40 [doi]
- Optimized All-to-All Connection Establishment for High-Performance MPI Libraries Over InfiniBandShulei Xu, Goutham Kalikrishna Reddy Kuncham, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. D. K. Panda. 41-50 [doi]
- MOSAIC: A Multi-Objective Optimization Framework for Sustainable Datacenter ManagementSirui Qi, Dejan S. Milojicic, Cullen E. Bash, Sudeep Pasricha. 51-60 [doi]
- 23D eDRAM TensorCore Architecture for Large-scale Matrix MultiplicationMengtian Yang, Yipeng Wang 0017, Jaydeep P. Kulkarni. 61-65 [doi]
- Contour Algorithm for ConnectivityZhihui Du, Oliver Alvarado Rodriguez, Fuhuan Li, Mohammad Dindoost, David A. Bader. 66-75 [doi]
- CAPTURE: Memory-Centric Partitioning for Distributed DNN Training with Hybrid ParallelismHenk Dreuning, Kees Verstoep, Henri E. Bal, Rob V. van Nieuwpoort. 76-86 [doi]
- MiCRO: Near-Zero Cost Gradient Sparsification for Scaling and Accelerating Distributed DNN TrainingDaegun Yoon, Sangyoon Oh 0001. 87-96 [doi]
- Understanding Patterns of Deep Learning Model Evolution in Network Architecture SearchRobert Underwood, Meghana Madhyastha, Randal C. Burns, Bogdan Nicolae. 97-106 [doi]
- Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel InferenceJinghan Yao, Nawras Alnaasan, Tian Chen, Aamir Shafi, Hari Subramoni, Dhabaleswar K. D. K. Panda. 107-116 [doi]
- Characterization and Detection of Artifacts for Error-Controlled Lossy CompressorsPu Jiao, Sheng Di, Jinyang Liu, Xin Liang 0001, Franck Cappello. 117-126 [doi]
- Performance Characterization of Containerized DNN Training and Inference on Edge AcceleratorsPrashanthi S. K, Vinayaka Hegde, Keerthana Patchava, Ankita Das, Yogesh Simmhan. 127-131 [doi]
- SECRE: Surrogate-Based Error-Controlled Lossy Compression Ratio Estimation FrameworkArham Khan, Sheng Di, Kai Zhao 0008, Jinyang Liu, Kyle Chard, Ian T. Foster, Franck Cappello. 132-142 [doi]
- Fast Algorithms for Scientific Data CompressionTania Banerjee, Jaemoon Lee, Jong Choi 0001, Qian Gong, Jieyang Chen, Scott Klasky, Anand Rangarajan 0001, Sanjay Ranka. 143-152 [doi]
- CAPIO: a Middleware for Transparent I/O Streaming in Data- Intensive WorkflowsAlberto Riccardo Martinelli, Massimo Torquati, Marco Aldinucci, Iacopo Colonnelli, Barbara Cantalupo. 153-163 [doi]
- JASS: A Tunable Checkpointing System for NVM-Based SystemsAkshin Singh, Smruti R. Sarangi. 164-173 [doi]
- Multi-Streamed Metadata-Integrity Verification For Cloud Migration In Deduplication SystemsShashank Khobragade, Santi Gopal Mondal, Kalyan Gunda. 174-178 [doi]
- CPU-GPU Tuning for Modern Scientific Applications using Node-Level HeterogeneityMathialakan Thavappiragasam, Vivek Kale. 179-183 [doi]
- DDIOSim: A Microarchitecture Simulator for Data Direct I/O TechnologyHari Sharan, Mythili Vutukuru, Biswabandan Panda. 184-188 [doi]
- FPGA Accelerated Bi-Cubic Convolution for Image InterpolationAnkit Choudhary, S. K. Vaibhav Kodavati, B. Mythili, R. V. G. Anjaneyulu, Manju Sarma. M. 189-193 [doi]
- DeltaSPARSE: High-Performance Sparse General Matrix-Matrix Multiplication on Multi-GPU SystemsShuai Yang, Changyou Zhang, Ji Ma. 194-202 [doi]
- Strategies for Fast I/O Throughput in Large-Scale Climate Modeling ApplicationsKoushik Sen, Sathish Vadhiyar, P. N. Vinayachandran. 203-212 [doi]
- ME- ViT: A Single-Load Memory-Efficient FPGA Accelerator for Vision TransformersKyle Marino, Pengmiao Zhang, Viktor K. Prasanna. 213-223 [doi]
- Graph Pattern Mining Paradigms: Consolidation and Renewed BearingVinícius Vitor dos Santos Dias, Samuel Ferraz, Aditya Vadlamani, Mahdi Erfanian, Carlos H. C. Teixeira, Dorgival O. Guedes, Wagner Meira Jr., Srinivasan Parthasarathy 0001. 224-233 [doi]
- Accelerating Time to Science using CRADLE: A Framework for Materials Data ScienceArafath Nihar, Thomas G. Ciardi, Rounak Chawla, Olatunde Akanbi, Vipin Chaudhary, Yinghui Wu, Roger H. French. 234-245 [doi]
- Optimizing the Training of Co-Located Deep Learning Models Using Cache-Aware StaggeringKevin Assogba, Bogdan Nicolae, M. Mustafa Rafique. 246-255 [doi]
- Towards Efficient I/O Pipelines Using Accumulated CompressionAvinash Maurya, Bogdan Nicolae, M. Mustafa Rafique, Franck Cappello. 256-265 [doi]
- Oikonomos-II: A Reinforcement-Learning, Resource-Recommendation System for Cloud HPCJan-Harm L. F. Betting, Chris I. De Zeeuw, Christos Strydis. 266-276 [doi]
- SCoOL - Scalable Common Optimization LibraryZainul Abideen Sayed, Jaroslaw Zola. 277-287 [doi]
- Data Locality Aware Computation Offloading in Near Memory Processing Architecture for Big Data ApplicationsSatanu Maity, Mayank Goel, Manojit Ghose. 288-297 [doi]
- Benesh: a Framework for Choreographic Coordination of In Situ WorkflowsPhilip E. Davis, Jacob Merson, Pradeep Subedi, Lee F. Ricketson, Cameron W. Smith, Mark S. Shephard, Manish Parashar. 298-308 [doi]
- Profit Maximization Using Collaborative Storage Management in Multi-Tier Edge-Cloud SystemShubhradeep Roy, Suvarthi Sarkar, Aryabartta Sahu. 309-318 [doi]
- Towards Enhanced I/O Performance of NVM File SystemsJiwoo Bang, Chungyong Kim, Eun-Kyu Byun, Hanul Sung, Jaehwan Lee 0001, Hyeonsang Eom. 319-323 [doi]
- Fast Parallel Tensor Times Same Vector for HypergraphsShruti Shivakumar, Ilya Amburg, Sinan G. Aksoy, Jiajia Li 0001, Stephen J. Young, Srinivas Aluru. 324-334 [doi]
- Reduce, Reuse, and Adapt: Accelerating Graph Processing on GPUsUllas A, Rupesh Nasre, R. Govindarajan. 335-346 [doi]
- Reduce Computational Complexity for Convolutional Layers by Skipping ZerosZhiyi Zhang, Pengfei Zhang, Zhuopin Xu, Qi Wang. 347-356 [doi]
- SpikeNC: An Accurate and Scalable Simulator for Spiking Neural Network on Multi-Core Neuromorphic HardwareLisheng Xie, Jianwei Xue, Liangshun Wu, Faquan Chen, Qingyang Tian, Yifan Zhou, Rendong Ying, Peilin Liu. 357-366 [doi]
- DAGit: A Platform For Enabling Serverless ApplicationsAnubhav Jana, Purushottam Kulkarni, Umesh Bellur. 367-376 [doi]
- Efficient GPU Implementation of Automatic Differentiation for Computational Fluid DynamicsMohammad Zubair, Desh Ranjan, Aaron Walden, Gabriel Nastac, Eric J. Nielsen, Boris Diskin, Marc F. Paterno, Samuel Jung, Joshua Hoke Davis. 377-386 [doi]
- A Lossless Compression Pipeline for Petabyte-Scale Whole Genome Sequencing DataAjeya Bhat, Sai Manasa Chadalavada, Nagakishore Jammula, Chirag Jain, Yogesh Simmhan. 387-391 [doi]