Data Engineer - Big Data

  • Location:
    San Jose, California, US
  • Area of Interest
    Engineer - Software
  • Job Type
  • Technology Interest
    Big Data, Analytics
  • Job Id

Who You’ll Work With

Digitization, data science, and automation are transforming every business. The Digital Experience and Analytics organization is playing a key role in transforming the sales, partner, and customer experience across Cisco. As part of the Customer Success and Sales function, this group is delivering upon a multi-year vision that digitizes the way Cisco engages with customers and leverages data science to intelligently drive action in an increasingly automated manner. This new organization is a catalyst for change that impacts all parts of Cisco.

The Customer Success data science team is an incubation hub for a range of initiatives that are changing the sales, partner, and customer experience. The team has been recognized for innovation both internally and through external industry awards. We’re looking for someone who thrives in a fast-paced environment in which they have significant responsibility and autonomy for delivering results and who gets excited about the prospect of significant learning and growth.

What You’ll Do
As a member of this dynamic and fast-paced team, you will prepare an easily accessible high performance “big data” environment, with a focus on designing, building, and integrating (incl. ETL) large volume of data. You will manage big data for data science activities to drive decision making across the organization.

Specific responsibilities include:

Work closely with data scientists and analysts to design and maintain scalable data models and pipelines

·     Engage with business stakeholders and data scientists to understand and analyze business problems

·     Build data solutions using available big data frameworks and technologies

·     Acquire data from various data sources (cloud, databases, Hadoop, sensors, weblogs and social)

·     Analyze huge sets of data in Hadoop (both structured and unstructured)

·     Perform data discovery, integration and identify / fix data quality issues

·     Define automated processes for tracking data quality and consistency

·     Perform impact assessment and deep dive analysis to ensure data integrity, usability and completeness

Who You Are

Minimum Qualifications

·     Bachelor’s degree or equivalent, plus 5+ years of experience in building data solutions in large scale distributed systems.

·     Strong data analysis skills - ability to identify, analyze and integrate various large complex data sources (Internal and external marketplace data providers) into a readily consumable data product.

·     Ability to communicate data requirements clearly to broad audiences

·     Strong experience in using open source frameworks

·     Familiarity with data science model development and production deployment 

Desired Skills

·     Highly curious, with a keen eye for data and a result-oriented attitude.

·     Require minimal supervision; able to work with cross-functional teams independently.

·     A proven history of building big data solutions - TBs or PBs of data.

·     A passion for working with huge data sets.

·     Solid experience with big data frameworks.

·     Extensive programming experience in Java, Python or Scala – we’re looking for a coding geek!

Why Cisco
We connect everything: people, processes, data, and things. We innovate everywhere, taking bold risks to shape the technologies that give us smart cities, connected cars, and handheld hospitals. And we do it in style with unique personalities who aren’t afraid to change the way the world works, lives, plays and learns. We are thought leaders, tech geeks, pop culture aficionados, and we even have a few purple haired rock stars. We celebrate the creativity and diversity that fuels our innovation. We are dreamers and we are doers. We Are Cisco.