All Stories

  1. Efficient Neural Networks on the Edge with FPGAs by Optimizing an Adaptive Activation Function
  2. Artifact Evaluation for ACM TRETS Papers Submitted from the FPT Journal Track
  3. FPGA-aware automatic acceleration framework for vision transformer with mixed-scheme quantization
  4. Strategies and Demonstration to Support Multiple Wireless Protocols with a Single RF Front-End
  5. A Novel Physical Layer Authentication With PAPR Reduction Based on Channel and Hardware Frequency Responses
  6. Real Time Receiver Baseband Processing Platform for Sub 6 GHz PHY Layer Experiments
  7. SIFO: Secure Computational Infrastructure Using FPGA Overlays
  8. QuTiBench
  9. Garbled Circuits in the Cloud using FPGA Enabled Nodes
  10. Detection of Different Wireless Protocols on an FPGA with the Same Analog/RF Front End
  11. High-Level and Compact Design of Cross-Channel LTE DownLink Channel Encoder
  12. FINN- R
  13. Local and Global Shared Memory for Task Based HPC Applications on Heterogeneous Platforms
  14. Scaling Neural Network Performance through Customized Hardware Architectures on Reconfigurable Logic
  15. Accelerating big data applications using lightweight virtualization framework on enterprise cloud
  16. FPGA modeling techniques for detecting and demodulating multiple wireless protocols
  17. FIM: Performance Prediction for Parallel Computation in Iterative Data Processing Applications
  18. Secure Function Evaluation Using an FPGA Overlay Architecture
  19. A Framework for Developing Parallel Applications with high level Tasks on Heterogeneous Platforms
  20. Using High Level GPU Tasks to Explore Memory and Communications Options on Heterogeneous Platforms
  21. Performance prediction techniques for scalable large data processing in distributed MPI systems
  22. Open-Source Variable-Precision Floating-Point Library for Major Commercial FPGAs
  23. Design space exploration of GPU Accelerated cluster systems for optimal data transfer using PCIe bus
  24. Unified and lightweight tasks and conduits: A high level parallel programming framework
  25. State-Action Based Link Layer Design for IEEE 802.11b Compliant MATLAB-Based SDR
  26. High-level hardware-software co-design of an 802.11a transceiver system using Zynq SoC
  27. Cardiac MRI compressed sensing image reconstruction with a graphics processing unit
  28. High-Level System Design of IEEE 802.11b Standard-Compliant Link Layer for MATLAB-Based SDR
  29. Validity and reliability of Kinect skeleton for measuring shoulder joint angles: a feasibility study
  30. Accelerating K-Means clustering with parallel implementations and GPU computing
  31. GPU implementation of reverse coordinate conversion for proteins
  32. Leakage evaluation on power balance countermeasure against side-channel attack on FPGAs
  33. Balance power leakage to fight against side-channel analysis at gate level in FPGAs
  34. Side-channel analysis of MAC-Keccak hardware implementations
  35. Accuracy of kinect for measuring shoulder joint angles in multiple planes of motion
  36. Kernel Specialization Provides Adaptable GPU Code for Particle Image Velocimetry
  37. Behavioral Non-portability in Scientific Numeric Computing
  38. Implementing a MATLAB-Based Self-configurable Software Defined Radio Transceiver
  39. Power analysis attack on hardware implementation of MAC-Keccak on FPGAs
  40. Accelerating protein coordinate conversion using GPUs
  41. Fast reconstruction of 3D volumes from 2D CT projection data with GPUs
  42. Reducing Processing Latency with a Heterogeneous FPGA-Processor Framework
  43. Validity and reliability of kinect for measuring shoulder joint angles
  44. Make it real: Effective floating-point reasoning via exact arithmetic
  45. FPGA-based hyperspectral covariance coprocessor for size, weight, and power constrained platforms
  46. Vendor agnostic, high performance, double precision Floating Point division for FPGAs
  47. Development of a low-cost, adaptive, clinician-friendly virtual rehabilitation system
  48. Kernel Specialization for Improved Adaptability and Performance on Graphics Processing Units (GPUs)
  49. A Message from the General Chair and Program Chair
  50. Minimum energy operation for clustered island-style FPGAs
  51. Minimum Energy Analysis and Experimental Verification of a Latch-Based Subthreshold FPGA
  52. Characterization of a single-supply subthreshold FPGA
  53. Cognitive radio universal software hardware
  54. CUDA and OpenCL implementations of 3D CT reconstruction for biomedical imaging
  55. VForce: An environment for portable applications on high performance systems with accelerators
  56. CRUSH: Cognitive Radio Universal Software Hardware
  57. Heterogeneous tasks and conduits framework for rapid application portability and deployment
  58. Cognitive Radio Universal Software Hardware
  59. Incremental clustering applied to radar deinterleaving
  60. Adaptable Two-Dimension Sliding Windows on NVIDIA GPUs with Runtime Compilation
  61. An Autonomous Vector/Scalar Floating Point Coprocessor for FPGAs
  62. VFloat
  63. Efficient template matching with variable size templates in CUDA
  64. A truly two-dimensional systolic array FPGA implementation of QR decomposition
  65. Implementing a Highly Parameterized Digital PIV System on Reconfigurable Hardware
  66. Accelerating phase unwrapping and affine transformations for optical quadrature microscopy using CUDA
  67. FPGA Supercomputing Platforms, Architectures, and Techniques for Accelerating Computationally Complex Algorithms
  68. Parallel Backprojection: A Case Study in High-Performance Reconfigurable Computing
  69. Implementing phase unwrapping using Field Programmable Gate Arrays or Graphics Processing Units: A comparison
  70. Special issue: General-purpose processing using graphics processing units
  71. Efficient Communication Between the Embedded Processor and the Reconfigurable Logic on an FPGA
  72. An efficient implementation of a phase unwrapping kernel on reconfigurable hardware
  73. An FPGA Implementation of Explicit-State Model Checking
  74. An Efficient Implementation of a Phase Unwrapping Kernel on Reconfigurable Hardware
  75. Dynamo: a runtime partitioning system for FPGA-based HW/SW image processing systems
  76. K-means Clustering for Multispectral Images Using Floating-Point Divide
  77. Writing Portable Applications that Dynamically Bind at Run Time to Reconfigurable Hardware
  78. Vforce: An Extensible Framework for Reconfigurable Supercomputing
  79. Advanced Components in the Variable Precision Floating-Point Library
  80. Automatic Sliding Window Operation Optimization for FPGA-Based
  81. Enabling MPEG-2 video playback in embedded systems through improved data cache efficiency
  82. Real-Time Particle Image Velocimetry for Feedback Loops Using FPGA Implementation
  83. Field-Programmable Gate Arrays in Embedded Systems
  84. Poster reception---Improving the performance of parallel backprojection on a reconfigurable supercomputer
  85. Parallel-Beam Backprojection: An FPGA Implementation Optimized for Medical Imaging
  86. Optimizing data intensive window-based image processing on reconfigurable hardware boards
  87. Applying reconfigurable hardware to the analysis of multispectral and hyperspectral imagery
  88. Accurate Power Estimation for Sequential CMOS Circuits Using Graph-based Methods
  89. Design issues for hardware implementation of an algorithm for segmenting hyperspectral imagery
  90. Effect of data truncation in an implementation of pixel clustering on a custom computing machine
  91. HML, a novel hardware description language and its translation to VHDL
  92. A data-centric approach to high-level synthesis
  93. Spatial and color clustering on an FPGA-based computer system
  94. Rothko: a three-dimensional FPGA
  95. Division and square root: choosing the right implementation
  96. Optimizing the data cache performance of a software MPEG-2 video decoder
  97. Rothko: A three dimensional FPGA architecture, its fabrication, and design tools
  98. Area and performance tradeoffs in floating-point divide and square-root implementations
  99. An automaton model for scheduling constraints in synchronous machines
  100. Non-restoring integer square root: A case study in design by principled optimization
  101. Reasoning about pipelines with structural hazards
  102. Verifying a logic-synthesis algorithm and implementation: a case study in software verification
  103. A methodology for efficient hardware verification
  104. PBS: proven Boolean simplification
  105. Erratum to: High level synthesis and generation FPGAs with the BEDROC system
  106. High level synthesis and generating FPGAs with the BEDROC system
  107. Formally verified synthesis of combinational CMOS circuits
  108. From programs to transistors: Verifying hardware synthesis tools
  109. Reasoning about the function and timing of integrated circuits with interval temporal logic
  110. Automatic determination of signal flow through MOS transistor networks
  111. Runtime assignment of reconfigurable hardware components for image processing pipelines
  112. Run-time execution of reconfigurable hardware in a Java environment
  113. Design tradeoffs in a hardware implementation of the k-means clustering algorithm
  114. High level synthesis for designing custom computing hardware
  115. Truly rapid prototyping requires high level synthesis