We serve candidates globally, so we only activate Google Tag Manager and other analytics after you opt in. This keeps us aligned with GDPR/UK DPA, ePrivacy, LGPD, and similar rules. Essential features still run without analytics cookies.
🤖 15+ AI Agents working for you. Find jobs, score and update resumes, cover letter, interview questions, missing keywords, and lots more.
HSIO Functional Validation Engineer at AMD - QATestingJobs.com
A
HSIO Functional Validation Engineer
AMD • George Town, Penang, Malaysia
hybridfull-time
Posted Feb 9, 2026Apply by Feb 9, 2027
Role & seniority: Intermediate-level Network System Debug Engineer for Datacenter Graphics and Accelerated Computing (DCGPU); guide/mentee role with leadership in root-cause analysis and cross-team coordination.
Stack/tools: GPU/SOC HW-FW debugging; Linux development environment; networking (Ethernet, InfiniBand, RDMA); Kubernetes/VM virtualization (KVM, HyperV); lab equipment (oscilloscopes, protocol analyzers, power supplies, multimeters); network prototypes and platform validation; debugging/documentation tooling.
Top 3 responsibilities
Debug/triage for a production-level quality initiative; drive root-cause resolution across IP layers and validate through stress tests, clock/power verification, and BOM/EC checks.
Define and execute platform-level validation test plans; collaborate with software/hardware teams and external partners to resolve issues.
Provide technical leadership, mentoring, and customer-facing support for network architecture questions; document debug flows and improve product processes.
Must-have skills
4–6+ years in system or SOC-level debug/triage; proven ability to resolve critical lab issues in Datacenter environments.
Strong understanding of GPU/system flow, HW/SW interaction, and platform bring-up.
Proficiency with Linux networking, Ethernet/InfiniBand designs, and RDMA; experience with lab hardware and debugging tools; excellent written/verbal communication.
Experience with network technologies in Datacenter, c
Full Description
WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE
We are seeking an engineer to join our team that will thrive in a fast-paced work environment, using effective communication, problem-solving and prioritization skills. Individuals that are well organized, show great attention to detail, and employ critical thinking are well-suited for our team. The Datacenter Graphics and Accelerated Computing (DCGPU) organization is looking for an experienced network system level debug engineer focused on Datacenter environments. Individual will be part of a quality initiative that involves driving weekly production level parts through specific validation that includes stress, Technical Data Package verification (clocks, frequency, power), and BOM/EC verification in various network configurations. Individual will need to be able to drive to root closure any issues encountered and communicate with the different IP layers for resolution.
THE PERSON
This AMD (Advanced Micro Devices) team is looking for an intermediate level person that can help guide the team, mentor upcoming developers, provide long range strategy, and is willing to jump in to help resolve issues quickly. You will be involved in all areas that impact the team including performance, automation, and development. The right candidate will be informed on the latest trends and become prepared to give consultative direction to senior management. Person should be experienced in debugging of complex HW/FW issues, understand the flow of a GPU through the different layers of an SOC and system. Communication is essential in working with different owners of the code stack as well the ability to drive issues via phone calls, chat messages.
KEY RESPONSIBILITIES
A powerful desire to learn new skills and understand new features as they are added
Proven record of accomplishment of working within and across groups.
Effective communication skills
Responsible for exploring opportunities to improve product
Work closely with other team members to understand design architecture and to propose solutions to improve and enhance products
Debug / triage engineer for a new quality initiative
Understanding of GPU/System level HW and SW flow
Provide leadership for driving to root cause issues / bugs
Communicate / Document flows and methods of debug ability
Embedded coding for hardware components and respective drivers for network components
Assist with network prototypes and in-depth testing to validate the design
Formulate and define platform level validation test plans based on product/customer needs
Troubleshoot and resolve platform network issues
Provide customer support regarding network architectural questions, product prerequisites, and product features
Interface with networking partners and software/hardware engineers
Work with software developers on network performance enhancement
PREFERRED EXPERIENCE
Exposure to systems architecture
4-6 yrs experience in System or SOC level debug and triage
Proven ability to drive resolution of critical problems within a lab, Datacenter
Relationship with external customers/partners and able to help resolve problems in their Data Center
Relationship with external customers/partners on ability to work manufacturing issues/failures
Relationship with external customers/partners on ability to define rqmts for manufacturing validation
4+ years’ working experience with network technologies including network selection and deployment in Datacenter environments
Experience with modern networking standards
Experience with mesh network routing protocols and switching protocols
Familiar with Ethernet and InfiniBand network designs and switch topologies
Linux Operating System as a development environment
Familiar with Ethernet and Infiniband networking in Linux and Windows environments
Familiar with Virtualization environments – KVM and HyperV
RDMA network configuration, troubleshooting
Linux kernel networking expertise
System/Platform level debug tools.
Familiar with networking environments that utilizes HPC / ML/DL workloads
Hands on experience with lab equipment like oscilloscopes, protocol analyzers, power supplies, multi meter
Familiar with Platform/System bring up and validation of GPU networks – intranode and internode. (Networking Adapters, cables, switches)
Significant experience in SoC and/or System debug of complex network issues
Develop / Document debug capabilities on a given SOC and System
Go-to-person for debugging of issues for the Production level Platform validation
Collaborate with internal teams on root causing issues, finding optimum resolutions
ACADEMIC CREDENTIALS
Bachelor’s or Master’s in Electrical Engineer, Computer Engineering, Computer Science, or a closely related field
LOCATION
Penang, Malaysia
#LI-JK1
#LI-Hybrid
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.
This posting is for an existing vacancy.