AT&T Principal-SRE in Dallas, Texas
About the Company
At AT&T, we’re connecting the world through the latest tech, top-of-the-line communications and the best in entertainment. Our groundbreaking digital solutions provide intuitive and integrated experiences for millions of customers across online, retail and care channels. Join our mission to deliver compelling communication and entertainment experiences to customers around the world as we continue to evolve as a technology-powered, human-centered organization. As part of our team, you’ll transform the way we deliver a seamless customer experience with digital at the center of all you do. In our world, digital is much larger than just an eCommerce channel, we are transforming all channels to digitally perform as one team to create a better customer experience. As we move through 2021, the digital transformation will revolutionize the digital space and you can build a career that will propel your future.
About the Team
The mission of our Digital Operations team is to operate a fault resilient, customer-centered, proactive DevOps team. The team is responsible for supporting systems that deliver AT&T’s customer experience, across multiple internet-facing eCommerce applications, databases, platforms and technology stacks. Our customer-journey centric Ops team is made up of Ops Engineers as well as Site Reliability Engineers (SREs) who are focused on ensuring a highly available, resilient, performant and secure customer experience.
About the Job
This position is responsible for SRE initiative within the Digital Operations organization. This is a strategically important team that is responsible for SRE roadmap. In this role you will lead the site reliability function. In addition you will work in an agile manner side by side with product and technology teams ensuring we are delivering a performant and resilient experience to our end customers. This role is suppose to handle complex system integration issues in our SPT Sales organization with finding out impact, work across multiple teams for troubleshooting and finding gaps/ opportunities to make the system more resilient.
Responsibilities and Day-to-Day View
Provide leadership, strategic direction and oversight over the following functions:
- Prod/ Non Prod infrastructure management.
-End to End Performance Testing
-Production Load Testing
-Chaos Testing & Engineering
Work closely with Product, Infrastructure and Technology teams to resolve performance and resiliency issues
Provide executive and leadership readouts
Champion and drive Site Reliability Engineering (SRE) practices
A Bachelor's degree in Computer Science, Information Systems, or related field from an accredited College or University
7(+) years of software development experience
5(+) years of experience in Business analysis
2(+) years of experience with Site Reliability Engineering and operations for internet/eCommerce applications.
Expertise with analyzing code and systems in a multi-tiered architecture
Experience with Public Cloud
Experience with Splunk, EFK, Dynatrace, Quantum Metrics
Solutions oriented with proven success in a fast-paced environment
Strong communication & leadership skills
AT&T is leading the way to the future – for customers, businesses and the industry. We're developing new technologies to make it easier for our customers to stay connected to their world. Together, we’ve built a premier integrated communications and entertainment company and an amazing place to work and grow. Team up with industry innovators every time you walk into work, creating the world you always imagined. Ready to #transformdigital with us? Apply now!
Bachelor's degree in Computer Science, Telecommunications, Electrical Engineering or related field
8-10 years related technical architect experience
Proficient in engineering cost estimates and economic analyses and models
Knowledge of wireless technologies standards and protocols (3GPP, Wi-Fi, WiMax, antennas, amplifiers, base stations, propagation, interference, spectrum)
Proficient in voice, video, and app technologies/protocols (circuit, VoIP, SIP, IMS, AIN, Camel, etc.)
Proficient in network and system architecture (subsystems, interfaces, hw/sw dependencies).
Proficient in message and conference systems and networks (multiple media, notification, presence, unified communications, video).
Knowledge in Network Management, Tools and Protocols (Configuration, IP network Address Management, Perf, Mgmt).
Understands Virtualization, Storage and Content Delivery Networks (Cloud, CDM, Grid, SAN).
Proficient in specifying and evaluating architecture requirements for RFXs.
Enterprise wide deployment planning and support for mission critical applications for major releases, both business and infrastructure related
Develops technical documentation on applications and systems
Ability to work with technical and business-oriented teams
AT&T will consider for employment qualified applicants in a manner consistent with the requirements of federal, state and local laws
We expect employees to be honest, trustworthy, and operate with integrity. Discrimination and all unlawful harassment (including sexual harassment) in employment is not tolerated. We encourage success based on our individual merits and abilities without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, age, disability, marital status, citizenship status, military status, protected veteran status or employment status.