Skip to content

benchmark and optimize on Arm64 #166

@drossetti

Description

@drossetti
  • run copybw and copylat on Arm64+directly attached GPU
  • in case, add optimized copy functions, e.g. using Neon intrinsic

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions