All Stories

  1. AIIO: Using Artificial Intelligence for Job-Level and Automatic I/O Performance Bottleneck Diagnosis
  2. I/O Access Patterns in HPC Applications: A 360-Degree Survey
  3. Real-time and post-hoc compression for data from Distributed Acoustic Sensing
  4. Understanding Parallel I/O Performance and Tuning
  5. PROV-IO
  6. Access Patterns and Performance Behaviors of Multi-layer Supercomputer I/O Subsystems under Production Load
  7. LBNL Superfacility Project Report
  8. Transparent Asynchronous Parallel I/O Using Background Threads
  9. Management and Storage of Scientific Data
  10. Position Papers for the ASCR Workshop on the Management and Storage of Scientific Data
  11. Optimizing Performance of Parallel I/O Accesses to Non-contiguous Blocks in Multiple Array Variables
  12. Tuning Parallel Data Compression and I/O for Large-scale Earthquake Simulation
  13. An In-Depth I/O Pattern Analysis in HPC Systems
  14. Exploiting user activeness for data retention in HPC systems
  15. Data-Aware Storage Tiering for Deep Learning
  16. I/O Bottleneck Detection and Tuning: Connecting the Dots using Interactive Log Analysis
  17. SCTuner: An Autotuner Addressing Dynamic I/O Needs on Supercomputer I/O Subsystems
  18. Characterizing Impacts of Storage Faults on HPC Applications: A Methodology and Insights
  19. Battle of the Defaults: Extracting Performance Characteristics of HDF5 under Production Load
  20. FasTensor Programming Model
  21. FasTensor User Interface
  22. FasTensor in Real Scientific Applications
  23. Introduction
  24. User-Defined Tensor Data Analysis
  25. GPU Direct I/O with HDF5
  26. Cross-facility science with the Superfacility Project at LBNL
  27. Towards HPC I/O Performance Prediction through Large-scale Log Analysis
  28. HPC Workload Characterization Using Feature Selection and Clustering
  29. Predicting and Comparing the Performance of Array Management Libraries
  30. Parallel Query Service for Object-centric Data Management Systems
  31. DASSA: Parallel DAS Data Storage and Analysis for Subsurface Event Detection
  32. Interfacing HDF5 with a scalable object‐centric storage system on hierarchical storage
  33. ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems
  34. Final Technical Report - Proactive Data Containers for Scientific Storage
  35. Tuning Object-Centric Data Management Systems for Large Scale Scientific Applications
  36. Exploring Metadata Search Essentials for Scientific Data Management
  37. Analysis in the Data Path of an Object-Centric Data Management System
  38. MIQS
  39. Revisiting I/O behavior in large-scale storage systems
  40. Sparse Data Management in HDF5
  41. Understanding Data Motion in the Modern HPC Data Center
  42. Enabling Transparent Asynchronous I/O using Background Threads
  43. Active Learning-based Automatic Tuning and Prediction of Parallel I/O Performance
  44. Terabyte-scale Particle Data Analysis
  45. DCA-IO: A Dynamic I/O Control Scheme for Parallel and Distributed File Systems
  46. A Zoom-in Analysis of I/O Logs to Detect Root Causes of I/O Performance Bottlenecks
  47. Optimizing I/O Performance of HPC Applications with Autotuning
  48. Parallel membership queries on very large scientific data sets using bitmap indexes
  49. SLOPE: Structural Locality-Aware Programming Model for Composing Array Data Analysis
  50. Extreme Heterogeneity 2018 - Productive Computational Science in the Era of Extreme Heterogeneity: Report for DOE ASCR Workshop on Extreme Heterogeneity
  51. ARCHIE: Data Analysis Acceleration with Array Caching in Hierarchical Storage
  52. A Year in the Life of a Parallel File System
  53. DART
  54. Evaluation of HPC Application I/O on Object Storage Systems
  55. UniviStor: Integrated Hierarchical and Distributed Storage for HPC
  56. IOMiner: Large-Scale Analytics Framework for Gaining Knowledge from I/O Logs
  57. A Transparent Server-Managed Object Storage System for HPC
  58. Toward Scalable and Asynchronous Object-Centric Data Management for HPC
  59. ArrayBridge: Interweaving Declarative Array Processing in SciDB with Imperative HDF5-Based Programs
  60. Toward Transparent Data Management in Multi-Layer Storage Hierarchy of HPC Systems
  61. SoMeta: Scalable Object-Centric Metadata Management for High Performance Computing
  62. ArrayUDF
  63. UMAMI
  64. Data Elevator: Low-Contention Data Movement in Hierarchical Storage System
  65. Exploring memory hierarchy and network topology for runtime AMR data sharing across scientific applications
  66. In Situ Storage Layout Optimization for AMR Spatio-temporal Read Accesses
  67. Resolution dependence of precipitation statistical fidelity in hindcast simulations
  68. SDS-Sort
  69. AMRZone: A Runtime AMR Data Sharing Framework for Scientific Applications
  70. PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures
  71. Usage Pattern-Driven Dynamic Data Layout Reorganization
  72. Characterization of extreme precipitation within atmospheric river events over California
  73. BD-CATS
  74. Security for the scientific data services framework
  75. Spatially clustered join on heterogeneous scientific data sets
  76. Collective Computing for Scientific Big Data Analysis
  77. Dynamic Model-Driven Parallel I/O Performance Tuning
  78. A study of file system read and write behavior on supercomputers
  79. Parallel In Situ Detection of Connected Components in Adaptive Mesh Refinement Data
  80. TECA: Petascale Pattern Recognition for Climate Science
  81. Pattern-driven parallel I/O tuning
  82. Heavy-tailed distribution of parallel I/O system response time
  83. Managing scientific data with named data networking
  84. Techniques for modeling large-scale HPC I/O workloads
  85. A multi-domain SDN for dynamic layer-2 path service
  86. Simplifying index file structure to improve I/O performance of parallel indexing
  87. Towards Energy Awareness in Hadoop
  88. NDM'14 Chairs' Welcome
  89. Adaptation and Policy-Based Resource Allocation for Efficient Bulk Data Transfers in High Performance Computing Environments
  90. Analysis of the Effect of Core Affinity on High-Throughput Flows
  91. Flexible Scheduling and Control of Bandwidth and In-transit Services for End-to-End Application Workflows
  92. Towards Managed Terabit/s Scientific Data Flows
  93. Parallel query evaluation as a Scientific Data Service
  94. Parallel data analysis directly on scientific file formats
  95. Model-Driven Data Layout Selection for Improving Read Performance
  96. Improving parallel I/O autotuning with performance modeling
  97. SDS
  98. Taming parallel I/O complexity with auto-tuning
  99. Segmented analysis for reducing data movement
  100. Expediting scientific data analysis with reorganization of data
  101. Optimizing fastquery performance on lustre file system
  102. A framework for auto-tuning HDF5 applications
  103. Why high performance visual data analytics is both relevant and difficult
  104. Characterizing the impact of end-system affinities on the end-to-end performance of high-speed flows
  105. Abstract: Auto-Tuning of Parallel IO Parameters for HDF5 Applications
  106. NDM 2012: Second International Workshop on Network-Aware Data Management
  107. Parallel I/O, analysis, and visualization of a trillion particle simulation
  108. Boosting Application-Specific Parallel I/O Optimization Using IOSIG
  109. TECA: A Parallel Toolkit for Extreme Climate Analysis
  110. Energy-Aware Workload Consolidation on GPU
  111. Special issue on Data Intensive Computing
  112. Detecting atmospheric rivers in large climate datasets
  113. Open problems in network-aware data management in exa-scale computing and terabit networking era
  114. Scientific data services
  115. Best-effort semantic document search on GPUs
  116. Data-aware scheduling of legacy kernels on heterogeneous platforms with distributed memory
  117. Exploiting the forgiving nature of applications for scalable parallel execution
  118. Special Issue of the Journal of Parallel and Distributed Computing: Data-Intensive Computing
  119. Core-aware memory access scheduling schemes
  120. Taxonomy of Data Prefetching for Multicore Processors
  121. Modeling Data Access Contention in Multicore Architectures
  122. Hiding I/O latency with pre-execution prefetching for parallel applications
  123. Parallel I/O prefetching using MPI file caching and I/O signatures
  124. Exploring Parallel I/O Concurrency with Speculative Prefetching
  125. A Taxonomy of Data Prefetching Mechanisms
  126. Server-Based Data Push Architecture for Multi-Processor Environments
  127. Improving Data Access Performance with Server Push Architecture
  128. Data access history cache and associated data prefetching mechanisms
  129. Memory Servers: A Scope of SOA for High-End Computing
  130. Automatic Memory Optimizations for Improving MPI Derived Datatype Performance
  131. ISOLATING COSTS IN SHARED MEMORY COMMUNICATION BUFFERING
  132. Quantification of Memory Communication
  133. Improving the performance of MPI derived datatypes by optimizing memory-access cost
  134. Predicting memory-access cost based on data-access patterns