Chennai, Tamil Nadu, India
Information Technology
Full-Time
SourcingXPress
Overview
Company: MulticoreWare Inc
Website: Visit Website
Business Type: Small/Medium Business
Company Type: Product & Service
Business Model: B2B
Funding Stage: Bootstrapped
Industry: Information Technology Product/Services
Salary Range: ₹ 7-30 Lacs PA
Job Description
We are looking for strong C/C++ developers with a passion for performance optimization and systems programming. While prior experience with CUDA, OpenCL etc., or hardware accelerators is a plus, we welcome candidates who are eager to learn and scale up in machine learning, computer vision, and numeric library optimization for CPUs, GPUs, DSPs, and accelerators.
You Will
Elaborate And Exact JD Of The Role Is Below
We are seeking a talented engineer to implement and optimize machine learning, computer vision, and numeric libraries for target hardware architecture, including CPUs, GPUs, DSPs, and other accelerators. Your expertise will be instrumental in enabling efficient and high-performance execution of algorithms on these hardware platforms.
Key Responsibilities
Website: Visit Website
Business Type: Small/Medium Business
Company Type: Product & Service
Business Model: B2B
Funding Stage: Bootstrapped
Industry: Information Technology Product/Services
Salary Range: ₹ 7-30 Lacs PA
Job Description
We are looking for strong C/C++ developers with a passion for performance optimization and systems programming. While prior experience with CUDA, OpenCL etc., or hardware accelerators is a plus, we welcome candidates who are eager to learn and scale up in machine learning, computer vision, and numeric library optimization for CPUs, GPUs, DSPs, and accelerators.
You Will
- Develop and optimize high-performance software using C/C++ for numerical computing, machine learning, and computer vision applications.
- Learn and apply low-level optimizations, including parallelization, vectorization, and memory management, to enhance execution on hardware platforms.
- Gain hands-on experience with CUDA, OpenCL, or similar programming models as part of the role.
- 4+ years of experience in C/C++ development, with a strong grasp of data structures, algorithms, and performance optimization.
- Willingness to learn and grow in GPU programming, parallel computing, and hardware acceleration.
- Knowledge of parallel computing concepts, SIMD instructions, and memory hierarchies is a plus.
- Prior experience with CUDA, OpenCL, or similar is advantageous but not mandatory.
- Strong problem-solving skills and the ability to work independently or in a team.
Elaborate And Exact JD Of The Role Is Below
We are seeking a talented engineer to implement and optimize machine learning, computer vision, and numeric libraries for target hardware architecture, including CPUs, GPUs, DSPs, and other accelerators. Your expertise will be instrumental in enabling efficient and high-performance execution of algorithms on these hardware platforms.
Key Responsibilities
- Implement and optimize machine learning, computer vision, and numeric libraries for target hardware architectures, including CPUs, GPUs, DSPs, and other accelerators.
- Work closely with software and hardware engineers to ensure optimal performance on target platforms.
- Implement low-level optimizations, including algorithmic modifications, parallelization, vectorization, and memory access optimizations, to fully leverage the capabilities of the target hardware architectures.
- Work with customers to understand their requirements and implement libraries to meet their needs.
- Develop performance benchmarks and conduct performance analysis to ensure the optimized libraries meet the required performance targets.
- Stay current with the latest advancements in machine learning, computer vision, and high-performance computing.
- BTech/BE/MTech/ME/MS/PhD degree in CSE/IT/ECE
- > 4 years of experience working in Algorithm Development, Porting, Optimization & Testing
- Proficient in programming languages such as C/C++, CUDA, OpenCL, or other relevant languages for hardware optimization.
- Hands-on experience with hardware architectures, including CPUs, GPUs, DSPs, and accelerators, and familiarity with their programming models and optimization techniques.
- Knowledge of parallel computing, SIMD instructions, memory hierarchies, and cache optimization techniques.
- Experience with performance analysis tools and methodologies for profiling and optimization.
- Knowledge of deep learning frameworks and techniques is good to have
- Strong problem-solving skills and ability to work independently or within a team.
- A Software/Product Development Organization delivering next-generation technologies.
- Great Place to Work (GPTW) Certified, fostering a culture of trust and collaboration.
- Recognized as ET's Future Ready Organization, driving innovation responsibly.
- Shape the Future of AI: At MulticoreWare Inc., you’ll work on cutting-edge technologies like computer vision, natural language processing, and generative AI, redefining industries and advancing humanity's digital future.
- Be Part of a Visionary Team: Collaborate with some of the brightest minds in AI and software engineering, solving complex challenges and driving innovation in high-performance computing and AI-powered solutions.
- Accelerate Your Growth: Experience unparalleled career development with exposure to groundbreaking projects, advanced tools, and a culture that values continuous learning and professional excellence.
- Make an Impact Globally: Work on products and solutions that influence industries like automotive, media, and healthcare, delivering responsible AI-driven innovation worldwide.
- Innovate with Purpose: Join a team where your passion for technology meets a mission-driven approach, creating solutions that blend innovation, efficiency, and sustainability.
Similar Jobs
View All
Talk to us
Feel free to call, email, or hit us up on our social media accounts.
Email
info@antaltechjobs.in