Applications for this position are currently paused

Senior Data Engineer / Machine learning Engineer -- 資料工程師 / 機器學習工程師

Save
Job updated 21 days ago
Actively Reviewing Resumes

Job Description

介紹 / Introduction

我們在建立一個計算平台,這個平台的使用對象是研究者以及藥廠,因此這個平台的功能會包括統計相關的計算、資料處理、資料分析、資料視覺化、機器學習建模。最酷的地方是,我們的資料除了常見的structure data,還有語音、文字與影片資料。這個平台是連結virtual與現實世界的橋樑。因此,我們歡迎工程師來一起建立語音與影片的資料處理流程、擷取醫學相關特徵,並將這些工具wrap成服務,以API及視覺化報表的方式協助使用者進行他們的研究。逐步地建構這個平台。

We're building software to transform long-term patient condition management. We are collaborating with a team of elite clinicians and medical researchers from Taiwan's top medical institutions. There are big questions about the care solutions. Phase 1 is enabling and supporting our partners’ research; phase 2 will be delivering care. AI is involved. 🤖

For researchers and pharmaceutical companies, this computing platform include statistical calculations, data processing, data analysis, data visualization, and machine learning modeling. The coolest thing is that in addition to common structured data, our data also includes voice, text and video data. This platform is a bridge between the virtual and the real world. Therefore, we welcome engineers to work together to build voice and video data processing, extract medical-related features, and wrap these tools into services to assist users in their research through APIs and visual reports.

工作內容

  1. 提取聲音與影片資料的特徵,給訓練深度學習模型使用
  2. 建立資料工作流,工作流上的模組必須通過模組測試
  3. 建立統計工具模組及分析工具模組
  4. 使用docker將模組容器化:建立docker image、 使用yaml定義和執行多容器應用程式
  5. 建立、管理、使用與優化PostgreSQL 與 MongoDB 資料庫
  6. 寫工具模組的使用文件
  7. 使用gitlab進行團隊開發工作與版本管控
  8. 與臨床團隊、軟體團隊和資料科學家合作
  1. Extract the features from audio and video data where these features are useful for further analysis and deep learning model training.
  2. Build data pipeline and do unit test of all tools you built
  3. Build statistical and analysis tools
  4. Build docker image of the tools, write yaml to run the container
  5. PostgreSQL and MongoDB management
  6. Write document of tools
  7. Use gitlab for collaboration on development and version control
  8. Collaborate with clinical team, software team and data scientists

Requirements

職能要求

    1. 8年以上的軟體工程經驗
    2. 具有開發演算法經驗,並樂意學習新演算法 (3年以上)
    3. 熟悉聲音以及影片資料的訊號處理方法 (3年以上)
    4. 熟悉聲音及影片資料的清理與特徵提取方法
    5. 具有統計學背景知識
    6. 優秀的溝通能力與責任感
    7. 熟悉gitlab進行基本CI/CD、建立issue、回報錯誤、版本管控等團隊開發經驗
    8. 熟悉如何使用docker
    9. 使用 python 或 javascript 開發
    10. 熟悉 pytorch 以及 tensorflow
    11. 熟悉 PostgreSQL以及MongoDB
    12. 能夠在Azure 或 GCP 或 AWS 部屬應用

    Requirements

    1. 8+ years in software engineering
    2. Experience in developing algorithms and willing to learn new ones (3+ years)
    3. Good understanding of signal processing in audio and video data (3+ years)
    4. Experience in audio and video data cleaning and feature extraction
    5. Good statistics understanding
    6. Good communication ability and high responsibility
    7. Experience in gitlab (basic CI/CD, issue, version control)
    8. Experience in docker
    9. Experience in python or javascript
    10. Experience in pytorch and tensorflow
    11. Experience in PostgreSQL and MongoDB
    12. Experience in Azure or GCP or AWS in how to deploy the tools

    加分項目

    1. 能夠用英文進行工作上的口語與文字溝通
    2. 對ML或DL的工程開發有興趣
    3. 了解 infrastructure as code 的概念
    4. 有管理data fabric 或 data mesh 的經驗
    5. 有使用 Prometheus 或 Kibana 或 Grafana 的經驗

    Plus

    1. Confidence communicating in English
    2. Enthusiastic in ML&DL model training, evaluation and testing
    3. Understand the concept of infrastructure as code
    4. Experience in managing data fabric or data mesh
    5. Experience in Prometheus or Kibana or Grafana

    Interview process

    Technical Interview with Head of Data Science
    Interview with CTO
    Pair Coding test with another member of Data Science team
    Behavioral Interview + Culture w/ CEO or Head of Product/Ops

    View all jobs
    View all jobs
    Save
    1
    8 years of experience required
    1,500,000 ~ 2,000,000 TWD / year
    Partial Remote Work
    Personal Invitation Link
    This is your personal referral link for job invitation. You'll receive an email notification when someone applied for the position via your job link.
    Share this job
    Logo of SymptomTrace - 星坦科技股份有限公司.

    About us

    We're building software to transform long-term patient condition management. We are collaborating with a team of elite clinicians and medical researchers from Taiwan's top medical institutions. There are big questions about the care solutions. Phase 1 is enabling and supporting our partners’ research; phase 2 will be delivering care. AI is involved. 🤖


    Team

    Avatar of the user.
    Head of Data Science