[WIP] Remove Gaussian blur code alternatives that are exotic or didn't work very well by griwodz · Pull Request #159 · alicevision/popsift

griwodz · 2024-08-06T08:50:17Z

Description

PopSift had a wide variety of downscaling and Gaussian filtering modes. These were written to increase the potential parallelism of PopSift, explore the trade-off between sequential operation and wider Gaussian filters, the impact of various filter width on quality, etc.

In practice, it seems that users stay with the default. The less successful downscaling/filtering options have therefore been removed.

Two options have been retained from the original PopSift code. Both implement the classical sequence that starts by scaling the input image and performing Gaussian filtering incrementally through the levels of an octave, and downscaling the following octave from the 3rd last level of the previous octave. These are the non-interpolating and the interpolating approaches.

The classical "non-interpolating" approach

It takes the neighbouring pixels and multiplies them with the weights from the pre-computed Gaussian filter. The width of this filter varies for each level to ensure a regular scale space.

The alternative "interpolating" approach

It starts with the same Gaussian filters, although their width is rounded up to a multiple of 2. The weight of two neighbouring cells g[n] and g[n+1] is then reformulated as a relative weight between these two cells, ie.:
g[n]I + g[n+1]J = abI + (1-a)bJ = b*(a*I + (1-a)J) where b=g[n]+g[n+1] and a=g[n]/(g[n]+g[n+1])

The appropriate values for b can then be read from an array, while the neighboring pixels I and J can be read with linear interpolation with a weight of A through the texture engine. This means that the term a*I + (1-a)J is handled by the texture hardware, leaving only one multiplication per 2 pixels.

Features list

This PR does not add any features. It removes features.
Faster and less accurate downscaling and Gaussian filtering modes have been removed.
Enums for downscaling modes have been changed, "OpenCV mode" has been removed, "PopSift mode" and "VLFeat mode" have been renamed to reflect their actual functions.

Implementation remarks

Removed fixed scaling code.
Removed code to downscale everything directly from input image.
Removed functions to interpolate from first image plane.
Removed specialized version to create very first level from input image.
Removed downscaling by interpolation, which could not be called with any parameter.
Restructured the calling code for pyramid building to make sure that host code that starts the CUDA kernels is located in the same code file, making the CUDA kernels static. This prevents the overhead for linkable CUDA code.
Simplified the solution with absolute sources. Returned to a solution without shuffle and identical code structure for horizontal and vertical Gaussian filtering.
Simplified and unified code for absolute source interpolated Gaussian filtering.
Renamed extrema refinement modes to have more intuitive names. They are no longer tied to PopSift vs VLFeat. (except that the command line parameters of the test code retains the old terms so far)

Handled in other PRs

Remove Gauss filter tables for direct downscaling using absolute tables (done in PR Remove direct downscaling from input image to top level of every octave #178)
Removed the config param ScalingMode (always use default) (done in PR Remove direct downscaling from input image to top level of every octave #178)
Use horiz_from_input_image exclusively for octave 0. Direct downscaling is now only used for the input image. Note that initial blur is assumed for every input image, even when it is later interpreted as initially unblurred. That does make a difference, but is apparently recommended (done in PR Remove direct downscaling from input image to top level of every octave #178)
Removed the narrower Gauss filter width that was called "OpenCV mode" (done in PR [sift] removed support for the old option for Gaussian filter width computation opencv #179)
Removed deprecated scaling mode "OpenCV". OpenCV was buggy when this code was written. It has improved since then. (done in PR [sift] Remove unreachable code #180)

griwodz · 2025-01-14T13:28:16Z

PR #162 has been merged because latest CUDA for Linux can no longer run without it.

work very well. Remove the config param ScalingMode (always use default). Remove fixed scaling code. Remove code to downscale everything directly from input image. Remove the narrower gauss filter width called "OpenCV mode". Remove functions to interpolate from first image plane. Remove specialized version to create very first level from input image. Remove Gauss filter tables for direct downscaling using absolute tables. Removed deprecated scaling mode "OpenCV". OpenCV was buggy when this code was written. It has improved since then. Also downscaling by interpolation, which could not be called with any parameter, is removed. Restructure the calling code for the last 2 pyramid building functions Move host code for normalized source kernel into kernel's file. Normalized source mode is only used for the input image. It uses the normalization feature of CUDA textures to scale the input image while creating the first octave. Simplify the solution with absolute sources. Return to a solution without shuffle and identical code structure for horizontal and vertical Gaussian filtering. Host functions to call Gaussian filtering from point textures moved in kernels' code file. Host functions to call Gaussian filtering from interpolated textures moved in kernels' code file. Simplified and unified code for absolute source interpolated Gaussian filtering. Use horiz_from_input_image exclusively for octave 0. Direct downscaling is not only use for the input image. Note that initial blur is assumed for every input image, even when it is later interpreted as initially unblurred. That does make a difference, but is apparently recommended. Extrema refinement modes have more intuitive names and are no longer tied to PopSift vs VLFeat. (except that the command line parameters of the test code retains the old terms so far)

griwodz added in progress cuda issues related to cuda versions labels Aug 6, 2024

griwodz changed the title ~~Remove Gaussian blur code alternatives that are never used or didn't work very well.~~ [WIP] Remove Gaussian blur code alternatives that are exotic or didn't work very well Aug 6, 2024

griwodz self-assigned this Aug 6, 2024

griwodz marked this pull request as draft August 12, 2024 10:57

griwodz force-pushed the dev/prune-pyramid-code branch from c94da94 to 9a20c15 Compare August 15, 2024 07:28

griwodz force-pushed the dev/prune-pyramid-code branch from 33ca53d to b192cdb Compare January 14, 2025 14:28

griwodz force-pushed the dev/prune-pyramid-code branch from b192cdb to 5768dd1 Compare August 29, 2025 07:31

Carsten Griwodz added 2 commits October 21, 2025 07:39

fixing a fatal indexing bug in the relative kernel

e5deef0

griwodz force-pushed the dev/prune-pyramid-code branch from 5768dd1 to e5deef0 Compare October 21, 2025 05:39

griwodz mentioned this pull request Oct 21, 2025

Remove direct downscaling from input image to top level of every octave #178

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Remove Gaussian blur code alternatives that are exotic or didn't work very well#159

[WIP] Remove Gaussian blur code alternatives that are exotic or didn't work very well#159
griwodz wants to merge 2 commits intodevelopfrom
dev/prune-pyramid-code

griwodz commented Aug 6, 2024 •

edited

Loading

Uh oh!

griwodz commented Jan 14, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

griwodz commented Aug 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

The classical "non-interpolating" approach

The alternative "interpolating" approach

Features list

Implementation remarks

Handled in other PRs

Uh oh!

griwodz commented Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

griwodz commented Aug 6, 2024 •

edited

Loading

griwodz commented Jan 14, 2025 •

edited

Loading