Home

"Success in creating AI would be the biggest event in human history." (Stephen Hawking)

I am a Machine Learning researcher with a primary focus on deep learning techniques for 3D perception, including automating 3D reconstruction, localization and mapping. My research interests revolve around developing innovative solutions using artificial neural networks for 3D applications. Over the course of my career, I have actively contributed to the design and development of various neural solutions. Some have been listed below.

Projects

Planar Gaussian Splatting (WACV 2024)

Planar Gaussian Splatting (PGS) introduces a novel neural rendering approach to learn the 3D geometry and parse the 3D planes of a scene, directly from multiple RGB images. The PGS leverages Gaussian primitives to model the scene and employ a hierarchical Gaussian mixture approach to group them. Similar Gaussians are progressively merged probabilistically in the tree-structured Gaussian mixtures to identify distinct 3D plane instances and form the overall 3D scene geometry.

The PGS achieves state-of-the-art performance in 3D planar reconstruction without requiring either 3D plane labels or depth supervision. In contrast to existing supervised methods that have limited generalizability and struggle under domain shift, PGS maintains its performance across datasets thanks to its neural rendering and scene-specific optimization mechanism.

Neural Mesh Fusion: UNSUPERVISED 3D PLANAR SURFACE UNDERSTANDING (IEEE ICIP 2024)

Neural Mesh Fusion (NMF) is a method for joint optimization of polygon meshes, estimated from multi-view image observations and unsupervised 3D planar-surface parsing of the scene. In contrast to implicit neural representations, NMF directly learns to deform surface triangle mesh and generate an embedding for unsupervised 3D planar segmentation through gradient-based optimization directly on the surface mesh. The conducted experiments show that NMF obtains competitive results compared to state-of-the-art multi-view planar reconstruction, while not requiring any ground-truth 3D or planar supervision.

Arxiv version (link)

VaLID: Variable-Length Input Diffusion for Novel View Synthesis (WACV 2024)

VaLID is a deep generative AI model. It performs novel-view image synthesis by processing either a single input image or variable-length multi-view images. This highly adaptive property of VaLID enables various applications in 3D reconstruction and novel view synthesis.

Arxiv version (link)

Deep Learning Frameworks for Weakly-supervised Indoor Wi-Fi Positioning (2020-2021)

Deep learning solutions for the large-scale indoor localization of people and robot agents via Wi-Fi signal with accessing only to a few room/zone level annotations.

NeurIPS demo, 2021 (link)

Video on youtube (link)

Corresponding manuscripts:

Modality-agnostic topology aware localization (link)

Farhad G. Zanjani, Ilia Karmanov, Hanno Achermann, Daniel Dijkman, Simone Merlin, Max Welling, Fatih Porikli.

NeurIPS, 2021.

WiCluster: Passive Indoor 2D/3D Positioning using WiFi without Precise Labels (link)

Ilia Karmanov, Farhad G. Zanjani, Simone Merlin, Ishaque Kadampot, Daniel Dijkman

IEEE Globecom, 2021.

Deep Learning Frameworks for Weakly-Supervised Indoor Localization (link)

Farhad G. Zanjani, Ilia Karmanov et al., NeurIPS Competitions and Demonstrations Track, 2021.

Mask-MCNet: Teeth-Instance Segmentation in 3D Point Cloud (2019)

Mask-MCNet is one of the first deep learning-based research for automating teeth segmentation in 3D point cloud of intra-oral scan (IOS) data. This model is able to process the IOS in its original spatial resolution without down-sampling to maintain the spatial precision. This work has been funded by Promaton Ltd.

The application of Mask-MCNet holds significant potential in assisting dentists in orthodontics and implantology. By leveraging this AI-enabled technology, dental professionals can benefit from automated teeth instance segmentation, streamlining their workflow and enhancing accuracy in diagnosis and treatment planning.

Corresponding manuscripts:

Mask-MCNet: Tooth instance segmentation in 3D point clouds of intra-oral scans. Neurocomputing Journal, 2021 (link).

Mask-MCNet: Instance Segmentation in 3D Point Cloud of Intra-oral Scans; Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2019.
Deep learning approach to semantic segmentation in 3D point cloud intra-oral scans of teeth, Medical Imaging with Deep Learning (MIDL), 2019.

DCGMM: Deep Convolutional GMM for Stained-Color Normalization (2018)

Deep Convolutional Gaussian Mixture Modeling (DCGMM) presents an advanced approach to address the challenge of stained-color normalization in histopathological slides. The primary objective is to enhance the performance of subsequent automated image analysis methods used for diagnostic purposes.

DCGMM offers a high-fidelity solution for stained-color normalization. This process ensures that the color appearance of histopathological slides is accurately adjusted, enabling more reliable and consistent image analysis results.

Corresponding manuscripts:

Deep convolutional gaussian mixture model for stain-color normalization or histopathological images; Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2018.

Histopathology stain-color normalization using deep generative models,Medical Imaging with Deep Learning (MIDL), 2018.

Stain normalization of histopathology images using generative adversarial networks; IEEE Int. Symposium on Biomedical Imaging (ISBI), 2018.

Needle detection and localization in 3D Ultrasound Volumes (2017-2018)

Ultrasound-guided interventions have emerged as a valuable non-invasive medical imaging technique, offering enhanced patient safety and improved health outcomes. However, the precise positioning of needles and transducers during these procedures poses significant challenges. To address this, automated localization of the needle in 3D ultrasound (US) volumes has emerged as a promising solution, overcoming the limitations of 2D approaches and facilitating accurate needle guidance.

In collaboration with Philips Healthcare, we introduced a novel approach utilizing a multiplane convolutional neural network as an efficient and accurate neural architecture for the detection and localization of needles in 3D US volumes. Our developed method demonstrated remarkable performance and efficacy, leading to its recognition with the Best Paper Award from the Medical Image Computing and Computer Assisted Intervention Society (MICCAI) in 2018.

Corresponding manuscripts:

Robust and semantic needle detection in 3D ultrasound using orthogonal-plane convolutional neural networks;International Journal of Computer Assisted Radiology and Surgery, 2018.

Improving needle detection in 3D ultrasound using orthogonal-plane convolutional networks;Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2017.

Coherent needle detection in ultrasound volumes using 3D conditional random fields;Proc.or SPIE Medical Imaging, 2018.

Localization of partially visible needles in 3D ultrasound using dilated CNNs; IEEE International Ultrasonics Symposium (IUS), 2019

© Alle rechten voorbehouden Bedrijfsnaam