Newly Blog

CMake

Posted on 2026-03-17 Edited on 2022-04-08 In compiler

in-source make
out-of-source make

Write Basic CMakeLists.txt

common head

1 2	PROJECT(projectname [CXX] [C] [Java]) # project name cmake_minimum_required(VERSION 2.8.12) # minimum cmake version

variable assignment

1	SET(VAR [VALUE] [CACHE TYPE DOCSTRING [FORCE]])

variable should be used by ${VAR} except for IF condition.
commonly used path variables are listed as follows:

<projectname>_BINARY_DIR = PROJECT_BINARY_DIR=CMAKE_BINARY_DIR
<projectname>_SOURCE_DIR = PROJECT__SOURCE_DIR=CMAKE_SOURCE_DIR
SET(EXECUTABLE_OUTPUT_PATH ${PROJECT_BINARY_DIR}/bin)
SET(LIBRARY_OUTPUT_PATH ${PROJECT_BINARY_DIR}/lib)

display message

1	MESSAGE([SEND_ERROR \| STATUS \| FATAL_ERROR] "message to display")

generate output binary/library

ADD_EXECUTABLE(hello ${SRC_LIST})
ADD_LIBRARY(libname [SHARED|STATIC|MODULE] [EXCLUDE_FROM_ALL] source1 source2 ... sourceN)
TARGET_LINK_LIBRARIES(target library1 <debug | optimized> library2 ...) /#target-specific

// change the name, version, and so on of the output (*e.g.*, library, binary)
SET_TARGET_PROPERTIES(target1 target2 ...
	PROPERTIES prop1 value1
	prop2 value2 ...)

Finding

search source

1 2	INCLUDE_DIRECTORIES([AFTER\|BEFORE] [SYSTEM] dir1 dir2 ...) #include files LINK_DIRECTORIES(directory1 directory2 ...)

CMAKE_INCLUDE_PATH, CMAKE_LIBRARY_PATH, CMAKE_MODULE_PATH are environment variables instead of CMake variables. when using FIND_***, CMAKE_INCLUDE_PATH, the above paths will be searched.

FIND_PATH(myHeader hello.h)
IF(myHeader)
INCLUDE_DIRECTORIES(${myHeader})
ENDIF(myHeader)

search commands

FIND_FILE(<VAR> name1 path1 path2 ...)
FIND_LIBRARY(<VAR> name1 path1 path2 ...)
FIND_PATH(<VAR> name1 path1 path2 ...)
FIND_PROGRAM(<VAR> name1 path1 path2 ...)
FIND_PACKAGE(<name> [major.minor] [QUIET] [NO_MODULE] [[REQUIRED|COMPONENTS] [componets...]]) # find \*.cmake

Compactness

include the content of other files

1	include(FILE) # load the content of FILE

hierarchical binary tree

1	ADD_SUBDIRECTORY(source_dir [binary_dir][EXCLUDE_FROM_ALL])

macro definition

MACRO(add_example name)
	ADD_EXECUTABLE(${name} ${name}.cpp)
	TARGET_LINK_LIBRARIES(${name} dlib::dlib )
ENDMACRO()

CMake Module

define FindHELLO.cmake module

FIND_PATH(HELLO_INCLUDE_DIR hello.h /usr/include/hello /usr/local/include/hello)
FIND_LIBRARY(HELLO_LIBRARY NAMES hello PATH /usr/lib/usr/local/lib)
IF (HELLO_INCLUDE_DIR AND HELLO_LIBRARY)
SET(HELLO_FOUND TRUE)
ENDIF (HELLO_INCLUDE_DIR AND HELLO_LIBRARY)
IF (HELLO_FOUND)
IF (NOT HELLO_FIND_QUIETLY)
MESSAGE(STATUS "Found Hello: ${HELLO_LIBRARY}")

Installation

cmake -DCMAKE_INSTALL_PREFIX=/usr # the default install target is /usr/local

install library and binary

INSTALL(TARGETS targets...
	[[ARCHIVE|LIBRARY|RUNTIME]
		[DESTINATION <dir>]
		[PERMISSIONS permissions...]
		[CONFIGURATIONS
	[Debug|Release|...]]
		[COMPONENT <component>]
		[OPTIONAL]
		] [...])

ARCHIVE is static library *.a; LIBRARY is dynamic library *.so; RUNTIME is executable binary

install regular file

INSTALL(FILES files... DESTINATION <dir>
	[PERMISSIONS permissions...]
	[CONFIGURATIONS [Debug|Release|...]]
	[COMPONENT <component>]
	[RENAME <name>] [OPTIONAL])

install script file (e.g., *.sh), almost the same with installing files except for permission

INSTALL(PROGRAMS files... DESTINATION <dir>
	[PERMISSIONS permissions...]
	[CONFIGURATIONS [Debug|Release|...]]
	[COMPONENT <component>]
	[RENAME <name>] [OPTIONAL])

install folders

INSTALL(DIRECTORY dirs... DESTINATION <dir>
	[FILE_PERMISSIONS permissions...]
	[DIRECTORY_PERMISSIONS permissions...]
	[USE_SOURCE_PERMISSIONS]
	[CONFIGURATIONS [Debug|Release|...]]
	[COMPONENT <component>]
	[[PATTERN <pattern> | REGEX <regex>]
	[EXCLUDE] [PERMISSIONS permissions...]] [...])

install *.cmake

1	INSTALL([[SCRIPT <file>] [CODE <code>]] [...])

Testing

1 2	ADD_TEST(mytest ${PROJECT_BINARY_DIR}/bin/main) ENABLE_TESTING()

After generating Makefile, run make test

CMake with Eclipse

Posted on 2026-03-17 Edited on 2022-04-08 In compiler

Cmake supports CDT4 and higher versions.

1	cmake -help # check the supported generator

Install CDT to Eclipse: http://www.eclipse.org/cdt/downloads.php

The eclipse build directory should be sibling directory of the source directory.

1
2
3

mkdir eclipse
cd eclipse
cmake -G "Eclipse CDT4 - Unix Makefiles" -D     CMAKE_BUILD_TYPE=Debug ../src_folder # note that CMAKE_BUILD_TYPE can be set as Debug or Release

cmake --build . --config Release is equivalent to make

cmake --build . --target install --config Release is equivalent to make install

White Balance

Posted on 2026-03-17 Edited on 2022-04-08 In paper note

White balance: White balance is the process of removing unrealistic color casts, so that objects which appear white in person eyes are shown white in the image and this is more relevant to the settings of digital cameras (auto white balance). Since human eyes are very good at judging what is white under different light sources, if an
white object is captured in a wrong or mismatched color temperature, the realism could be significantly degraded.

datasets: https://cvil.eecs.yorku.ca/projects/public_html/sRGB_WB_correction/dataset.html

Color constancy: Color constancy is the ability to perceive color of objects, invariant to the color of the light source and it’s quite related to the human visual system. Existing computational color constancy methods
address this problem by first estimating the color of the light source and then correcting the input images pixel
to pixel to make it as taken under a white light source.

datasets: http://colorconstancy.com/evaluation/datasets/index.html

Zero-Shot Semantic Segmentation

Posted on 2026-03-17 Edited on 2022-04-08 In paper note

standard semantic segmentation: [1] [2] [3]
binary segmentation: [4]

Reference

[1] Rohan Doshi, Olga Russakovsky, “zero-shot semantic segmentation”, bachelor thesis.

[2] Y. Xian, S. Choudhury, Y. He, B. Schiele and Z. Akata , “SPNet: Semantic Projection Network for Zero- and Few-Label Semantic Segmentation”, CVPR, 2019.

[3] Maxime Bucher, Tuan-Hung Vu, Matthieu Cord, Sorbonne, Patrick Pérez, “Zero-Shot Semantic Segmentation”, 2019

[4] Kato, Naoki, Toshihiko Yamasaki, and Kiyoharu Aizawa. “Zero-Shot Semantic Segmentation via Variational Mapping.” ICCV Workshops. 2019.

Zero-Shot Object Detection

Posted on 2026-03-17 Edited on 2022-04-08 In paper note

ZSL based on bounding box features
- [1]: use background bounding boxes from background classes
- [3]: classification loss with semantic clustering
End-to-end zero-shot object detection
- [2]: extend YOLO, concatenate three feature maps to predict confidence score.
- [4]: use polarity loss similar to focal loss and vocabulary to enhance word vector
- [5]: output both classification scores and semantic embeddings
Feature generation
- [6]: synthesize
  visual features for unseen classes
- [7]: semantics-preserving graph propagation modules that enhance both category and region representations

Reference

[1] Ankan Bansal, Karan Sikka, Gaurav Sharma, Rama Chellappa, Ajay Divakaran, “Zero-Shot Object Detection”, ECCV, 2018.

[2] Pengkai Zhu, Hanxiao Wang, and Venkatesh Saligrama, “Zero Shot Detection”, T-CSVT, 2019.

[3] Rahman, Shafin, Salman Khan, and Fatih Porikli. “Zero-shot object detection: Learning to simultaneously recognize and localize novel concepts.” arXiv preprint arXiv:1803.06049 (2018).

[4] Rahman, Shafin, Salman Khan, and Nick Barnes. “Polarity Loss for Zero-shot Object Detection.” arXiv preprint arXiv:1811.08982 (2018).

[5] Demirel, Berkan, Ramazan Gokberk Cinbis, and Nazli Ikizler-Cinbis. “Zero-Shot Object Detection by Hybrid Region Embedding.” arXiv preprint arXiv:1805.06157 (2018).

[6] Hayat, Nasir, et al. “Synthesizing the unseen for zero-shot object detection.” Proceedings of the Asian Conference on Computer Vision. 2020.

[7] Yan, Caixia, et al. “Semantics-preserving graph propagation for zero-shot object detection.” IEEE Transactions on Image Processing 29 (2020): 8163-8176.

Zero-Shot Learning

Posted on 2026-03-17 Edited on 2022-04-08 In paper note

Zero-shot learning focuses on the relation between visual features X, semantic embeddings A, and category labels Y. Based on the approach, existing zero-shot learning works can be roughly categorized into the following groups:

1) semantic relatedness: X->Y (semantic similarity; write classifier)

2) semantic embedding: X->A->Y (map from X to A; map from A to X; map between A and X into common space)

Based on the setting, existing zero-shot learning works can be roughly categorized into the following groups:

1) inductive ZSL (do not use unlabeled test images in the training stage) v.s. semi-supervised/transductive ZSL (use unlabeled test images in the training stage)

2) standard ZSL (test images only from unseen categories) v.s. generalized ZSL (test images from both seen and unseen categories) (novelty detection, calibrated stacking)

Ideas:

Mapping: dictionary learning, metric learning, etc
Embedding: multiple embedding [1], free embedding [1], self-defined embedding [1]
Application: video->object(attribute)->action [1], image->object(attribute)->scene
Combination: with active learning [1] [2], online learning [1]
External knowledge graph: WordNet-based [1], NELL-based [2]
Deep learning: graph neural network [1], RNN [2]
Generate synthetic exemplars for unseen categories: synthetic images [SP-AEN] or synthetic features [SE-ZSL] [GAZSL] [f-xGAN]

Critical Issues:

generalized ZSL, why first predict seen or unseen?: As claimed in [1], since we only see labeled data from seen classes, during training, the scoring functions of seen classes tend to dominate those of unseen classes, leading to biased predictions in GZSL and aggressively classifying a new data point into the label space of S because classifiers for the seen classes do not get trained on negative examples from the unseen classes.
hubness problem [1][2]: As claimed in [2], one practical effect of the ZSL domain shift is the Hubness problem. Specifically, after the domain shift, there are a small set of hub test-class prototypes that become nearest or K nearest neighbours to the majority of testing samples in the semantic space, while others are NNs of no testing instances. This results in poor accuracy and highly biased predictions with the majority of testing examples being assigned to a small minority of classes.
projection domain shift: what is the impact on the decision values?

Datasets:

small-scale datasets: CUB, AwA, SUN, aPY, Dogs, FLO
large-scale dataset: ImageNet

Survey and Resource:

Other applications:

zero-shot object detection
zero-shot figure-ground segmentation [1]
zero-shot semantic segmentation
zero-shot retrieval
zero-shot domain adaptation

Weakly-supervised Segmentation

Posted on 2026-03-17 Edited on 2023-03-16 In paper note

Using web data for semantic segmentation:
- [1]: crawl web images with white background and generate composite images to initialize segmentation network
- [2]: train a segmentation network using web data to obtain rough segmentation mask
image-level semantic/instance segmentation: [9] [10] [11]
box-level semantic/instance segmentation: [3] [4] [5] [6]
scribble/point-level semantic segmentaiton: [7] [8] [12] [13] [14] [15] [16] [17] [18] [19]

Reference

[1] Bin Jin, Maria V. Ortiz Segovia, Sabine Süsstrunk:
Webly Supervised Semantic Segmentation. CVPR 2017.

[2] Tong Shen, Guosheng Lin, Chunhua Shen, Ian D. Reid:
Bootstrapping the Performance of Webly Supervised Semantic Segmentation. CVPR 2018.

[3] Dai, Jifeng, Kaiming He, and Jian Sun. “Boxsup: Exploiting bounding boxes to supervise convolutional networks for semantic segmentation.” ICCV, 2015.

[4] Khoreva, Anna, et al. “Simple does it: Weakly supervised instance and semantic segmentation.” CVPR, 2017.

[5] Ahn, Jiwoon, Sunghyun Cho, and Suha Kwak. “Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations.” CVPR, 2019.

[6] Hsu, Cheng-Chun, et al. “Weakly Supervised Instance Segmentation using the Bounding Box Tightness Prior.” NeurIPS. 2019.

[7] Lin, Di, et al. “Scribblesup: Scribble-supervised convolutional networks for semantic segmentation.” CVPR, 2016.

[8] Bearman, Amy, et al. “What’s the point: Semantic segmentation with point supervision.” ECCV, 2016.

[9] Zhu, Yi, et al. “Learning instance activation maps for weakly supervised instance segmentation.” CVPR, 2019.

[10] Wang, Xiang, et al. “Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning.” International Journal of Computer Vision (2020): 1-14.

[11] Jo, Sanhyun, and In-Jae Yu. “Puzzle-CAM: Improved localization via matching partial and full features.” arXiv preprint arXiv:2101.11253 (2021).

[12] Tang, Meng, et al. “Normalized cut loss for weakly-supervised cnn segmentation.” CVPR, 2018.

[13] Tang, Meng, et al. “On regularized losses for weakly-supervised cnn segmentation.” ECCV, 2018.

[14] Marin, Dmitrii, et al. “Beyond gradient descent for regularized segmentation losses.” CVPR, 2019.

[15] Wang, Bin, et al. “Boundary perception guidance: A scribble-supervised semantic segmentation approach.” IJCAI, 2019.

[16] Pan, Zhiyi, et al. “Scribble-supervised semantic segmentation by uncertainty reduction on neural representation and self-supervision on neural eigenspace.” ICCV, 2021.

[17] Xu, Jingshan, et al. “Scribble-supervised semantic segmentation inference.” ICCV, 2021.

[18] Chen, Hongjun, et al. “Seminar learning for click-level weakly supervised semantic segmentation.” Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021.

[19] Liang, Zhiyuan, et al. “Tree energy loss: Towards sparsely annotated semantic segmentation.” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022.

Weakly-supervised Object Detection

Posted on 2026-03-17 Edited on 2022-04-08 In paper note

webly supervised object detection [1]
use a few bounding box annotations and a large number of image label annotations [2]

Reference

[1] Exploiting Web Images for Weakly Supervised Object Detection. IEEE Trans. Multimedia 21(5): 1135-1146 (2019)

[2] DLWL: Improving Detection for Lowshot classes with Weakly Labelled data, Vignesh Ramanathan, Rui Wang, Dhruv Mahajan, CVPR2020

Weakly-supervised Localization

Posted on 2026-03-17 Edited on 2022-04-08 In paper note

Closely related to weakly-supervised segmentation.

attention based: [1] [2]

Reference

[1] Zhang, Xiaolin, et al. “Adversarial complementary learning for weakly supervised object localization.” CVPR, 2018.

[2] Zhang, Xiaolin, et al. “Self-produced guidance for weakly-supervised object localization.” ECCV, 2018.

Weakly-supervised Classification

Posted on 2026-03-17 Edited on 2022-04-08 In paper note

Two problems:

Label noise: label flip noise (belong to other training categories) and outlier noise (does not belong to any training category).
Domain shift: domain distribution mismatch between web data and consumer data.

Solutions:

label flip layer: [1] [2] [3]
multi-instance learning: [4] (pixel-level attention) [5] [6] [19](image-level attention)
reweight training samples: [7] [8] [9]
curriculumn learning: [10] [11]
bootstrapping: [12]
negative learning: [18]
Cyclical Training: [20]

Use auxiliary clean data:

active learning (select training samples to annotate): [13]
reinforcement learning (learn labeling policies): [14]
analogous to semi-supervised learning
- partial data with both noisy labels and clean labels as well as partial data with only noisy labels [15] [3] [7]
- partial data with noisy labels and partial data with clean labels [16] [17]

Datasets:

There are two types of label noise: synthetic label noise and web label noise.

large-scale web datasets: webvision v1, webvision v2
fine-grained web datasets: clothing, car, Stanford Dogs, Food101N, MIT indoor67, skin disease-198
synthetic noisy datasets via label flipping: CIFAR-10/100

Surveys:

A Survey of Label-noise Representation Learning: Past, Present and Future

Reference

[1] Chen, Xinlei, and Abhinav Gupta. “Webly supervised learning of convolutional networks.” ICCV, 2015.

[2] Sukhbaatar, Sainbayar, et al. “Training convolutional networks with noisy labels.” arXiv preprint arXiv:1406.2080 (2014).

[3] Xiao, Tong, et al. “Learning from massive noisy labeled data for image classification.” CVPR, 2015.

[4] Zhuang, Bohan, et al. “Attend in groups: a weakly-supervised deep learning framework for learning from web data.” CVPR, 2017.

[5] Wu, Jiajun, et al. “Deep multiple instance learning for image classification and auto-annotation.” CVPR, 2015.

[6] Ilse, Maximilian, Jakub M. Tomczak, and Max Welling. “Attention-based deep multiple instance learning.” arXiv preprint arXiv:1802.04712 (2018).

[7] Lee, Kuang-Huei, et al. “Cleannet: Transfer learning for scalable image classifier training with label noise.” CVPR, 2018.

[8] Liu, Tongliang, and Dacheng Tao. “Classification with noisy labels by importance reweighting.” T-PAMI, 2015.

[9] Misra, Ishan, et al. “Seeing through the human reporting bias: Visual classifiers from noisy human-centric labels.” CVPR, 2016.

[10] Guo, Sheng, et al. “Curriculumnet: Weakly supervised learning from large-scale web images.” ECCV, 2018.

[11] Jiang, Lu, et al. “Mentornet: Learning data-driven curriculum for very deep neural networks on corrupted labels.” arXiv preprint arXiv:1712.05055 (2017).

[12] Reed, Scott, et al. “Training deep neural networks on noisy labels with bootstrapping.” arXiv preprint arXiv:1412.6596 (2014).

[13] Krause, Jonathan, et al. “The unreasonable effectiveness of noisy data for fine-grained recognition.” ECCV, 2016.

[14] Yeung, Serena, et al. “Learning to learn from noisy web videos.” CVPR, 2017.

[15] Veit, Andreas, et al. “Learning from noisy large-scale datasets with minimal supervision.” CVPR, 2017.

[16] Xu, Zhe, et al. “Webly-supervised fine-grained visual categorization via deep domain adaptation.” T-PAMI, 2016.

[17] Li, Yuncheng, et al. “Learning from noisy labels with distillation.” ICCV, 2017.

[18] Kim, Youngdong, et al. “Nlnl: Negative learning for noisy labels.” ICCV, 2019.

[19] “MetaCleaner: Learning to Hallucinate Clean Representations for Noisy-Labeled Visual Recognition”, CVPR, 2019.

[20] Huang, Jinchi, et al. “O2u-net: A simple noisy label detection approach for deep neural networks.” ICCV, 2019.