Community Academy Cognite Lab

Entity Matching with Python SDK

Course 3 of 7 in Data Engineer Basics - Transform and Contextualize

Learn about matching entities using Python SDK, different entity matching models, and more

rate limit

Code not recognized.

About this course

In this course, you will see a demonstration of how to perform entity matching using Cognite's Python SDK, explore the available capabilities, and explain the parameter combinations suitable for the process. Additionally, you will gain a brief understanding of what happens in the backend.

This course is for data engineers looking to up their knowledge and skills in entity matching.

NOTE: This course is for the asset-centric data model. A new course for data modelling will be available in 2026.

At the end of the course, you will be able to:

  • Understand why you should use the Python SDK for entity matching
  • Describe when and how to use supervised and unsupervised entity matching models
  • Describe different feature types and how to use them

It's recommended to have completed Cognite Data Fusion Fundamentals and Entity Matching: The Concept before taking this course, so you have a basic understanding of the CDF data model and match entities in CDF.


This course was developed by Cognite Academy, with subject matter experts:

Torbjørn Opheim | Software Engineer

Sofie Haug | Academy Engineer 

 

Curriculum30 min

  • Welcome
  • How to succeed with this course
  • Match entities using Python SDK
  • Why Python SDK for entity matching
  • Check your knowledge
  • Access and setup
  • Notebook setup
  • Supervised entity matching with Python SDK
  • Demonstration on supervised entity matching
  • Check your knowledge
  • Unsupervised entity matching with Python SDK
  • Demonstration on performing unsupervised entity matching
  • Check your knowledge
  • End of course
  • Tips and tricks
  • Key takeaways
  • Share your feedback

About this course

In this course, you will see a demonstration of how to perform entity matching using Cognite's Python SDK, explore the available capabilities, and explain the parameter combinations suitable for the process. Additionally, you will gain a brief understanding of what happens in the backend.

This course is for data engineers looking to up their knowledge and skills in entity matching.

NOTE: This course is for the asset-centric data model. A new course for data modelling will be available in 2026.

At the end of the course, you will be able to:

  • Understand why you should use the Python SDK for entity matching
  • Describe when and how to use supervised and unsupervised entity matching models
  • Describe different feature types and how to use them

It's recommended to have completed Cognite Data Fusion Fundamentals and Entity Matching: The Concept before taking this course, so you have a basic understanding of the CDF data model and match entities in CDF.


This course was developed by Cognite Academy, with subject matter experts:

Torbjørn Opheim | Software Engineer

Sofie Haug | Academy Engineer 

 

Curriculum30 min

  • Welcome
  • How to succeed with this course
  • Match entities using Python SDK
  • Why Python SDK for entity matching
  • Check your knowledge
  • Access and setup
  • Notebook setup
  • Supervised entity matching with Python SDK
  • Demonstration on supervised entity matching
  • Check your knowledge
  • Unsupervised entity matching with Python SDK
  • Demonstration on performing unsupervised entity matching
  • Check your knowledge
  • End of course
  • Tips and tricks
  • Key takeaways
  • Share your feedback