hero

PL Network

196
companies
468
Jobs

Senior Engineer, SRE & Product Support

Hex Trust

Hex Trust

Product, Customer Service
Hong Kong · Sheung Wan, Hong Kong
Posted on Friday, September 6, 2024

About Hex Trust

Hex Trust is a fully-licensed and insured digital asset custodian. Led by veteran banking technologists and award-winning financial services experts, Hex Trust has built Hex Safe, a proprietary bank-grade platform that delivers solutions for digital asset protocols, foundations, financial institutions, and the Web3 ecosystem. Hex Trust has offices in Singapore, Hong Kong, Dubai, Italy, and Vietnam.

About the job

Responsible for monitoring the digital asset custody technology stack to ensure the stack is operational 24x7, stable and has sufficient capacity. Provide timely root cause analysis of platform and application incidents, involving the key resources as needed, so we can quickly restore normal operation.

Building out automated solutions for complex operational problems using industry best practice and cloud native technologies. Actively seek out innovative solutions towards operational excellence and coordinate proactively with development, operations, and the wider platform team to improve system availability, security, performance, and maintainability.

Responsibilities

  • Ensure continuous, scalable and robust operation of our production environments.
  • Collaborate closely with the development teams in a fast-paced delivery environment to foster the SRE mindset as part of the software development process.
  • Codify and rollout shared tooling and process/service to enable development teams continuously deliver new features while improving non-functional requirements such as system availability, security, performance, and maintainability.
  • Working with development and DevOps to ensure the appropriate level of component redundancy and infrastructure capacity is in place.
  • Proactively analyse events and provide ongoing recommendations to incorporate process improvements to prevent service impacting incidents.
  • Manage Level 1 through to Level 3 production support.