Let find() take format as argument to allow returning as list #10

mhabedan · 2025-10-15T13:01:50Z

Is there a specific reason why Client.find() returns a string if the ids keyword is given? I see why that is helpful when calling find from the command line but if using the python interface, it seems unnecessary. In particular, if the list entries are first joined into a string by ' '.join(find_results) only to be broken up into a list again in user's scripts. (I have one where exactly this is the case.)

Insofar, if there are no principal objections, this PR resolves the issue by adding a format argument to Client.find(). To leave the default behaviour of find() unchanged, the default format remains str. It can also be set to list, set or tuple now, though, if desired.

Matching tests have been added, the readme updated and the version bumped.

…ssibilities

Copilot

Pull Request Overview

This PR adds a format parameter to the Client.find() method, allowing users to specify the return type (str, list, set, or tuple) when retrieving IDs. Previously, IDs were always returned as a space-separated string, requiring users to manually split them into lists. The default behavior remains unchanged (returning a string) for backward compatibility.

Key Changes:

Added format parameter to find() method with support for str, list, set, and tuple return types
Updated _build_urls() to accept strings in addition to lists/tuples for id_list
Bumped version from 0.3.0 to 0.3.1

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
hepdata_cli/api.py	Added `format` parameter to `find()` method with logic to return results in different formats; updated `_build_urls()` to handle string input; fixed typo in assertion message
tests/test_search.py	Updated test parameters and assertions to validate the new `format` parameter functionality
tests/test_download.py	Added tests for different input formats (str, list, set, tuple) and new integration test combining find() with download()
README.md	Updated documentation to explain the new `format` parameter and showed example usage with `format=list`
hepdata_cli/version.py	Bumped version to 0.3.1

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

hepdata_cli/api.py

GraemeWatt

Thanks for the PR! I agree that it is useful to add the format option without changing the default behaviour, which comes from the arxiv-cli package that originally inspired hepdata-cli. However, the arxiv-cli package has not been maintained for 7 years and is now difficult to install. In a recent commit 3e8a9df I replaced its use in Example 6 with arxiv.py, but then I had to insert an additional line id_list = id_list.split() for it to work. The new format option makes it simpler. I'd be happy to merge after you address my two comments (and the two comments from Copilot).

GraemeWatt · 2025-10-21T12:28:07Z

hepdata_cli/api.py

+            elif format in (set, tuple):
+                return format(find_results)
+            else:
+                raise TypeError(f"Cannot return results in specfied format: {format}. Allowed formats are: {str}, {list}, {set}, {tuple}.")


This line is not covered by the tests, resulting in a -0.4% decrease in test coverage for the PR. Please add a test that raises the TypeError exception.

Good point. Fixed in 3de6810.

GraemeWatt · 2025-10-21T12:33:08Z

tests/test_search.py

 from click.testing import CliRunner

-from hepdata_cli.api import Client
+from hepdata_cli.api import Client, MAX_MATCHES, MATCHES_PER_PAGE


Why import MAX_MATCHES and MATCHES_PER_PAGE here? They're not used in this file.

Apologies, that was left over from an earlier version of the tests. Removed now.

GraemeWatt · 2025-10-21T15:15:05Z

@codecov-ai-reviewer review

codecov-ai · 2025-10-21T15:15:54Z

hepdata_cli/api.py

+            else:
+                raise TypeError(f"Cannot return results in specfied format: {format}. Allowed formats are: {str}, {list}, {set}, {tuple}.")


Typo in error message: 'specfied' should be 'specified'. This is a user-facing error message that will appear unprofessional with the typo.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

mhabedan · 2026-01-23T17:38:57Z

Sorry for the long wait. I think I have addressed all issues now.

mhabedan added 5 commits October 15, 2025 10:53

let find() take format as argumen to allow returning as list

5a719c6

add set as format

eb26509

update readme, add tuple case, extend tests to download

ca7c3a1

check whether id_list is instance of str instead of negating other po…

5c6b8dc

…ssibilities

correct error message

aec4772

GraemeWatt requested a review from Copilot October 21, 2025 12:20

Copilot AI reviewed Oct 21, 2025

View reviewed changes

hepdata_cli/api.py Outdated Show resolved Hide resolved

hepdata_cli/api.py Outdated Show resolved Hide resolved

GraemeWatt requested changes Oct 21, 2025

View reviewed changes

This comment has been minimized.

Sign in to view

codecov-ai bot reviewed Oct 21, 2025

View reviewed changes

mhabedan and others added 3 commits January 22, 2026 17:01

Add missing spaces around equality operator

9ee4b23

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

fix typo

9364df8

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

remove unused imports, test raising TypeError

3de6810

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Let find() take format as argument to allow returning as list #10

Let find() take format as argument to allow returning as list #10

mhabedan commented Oct 15, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

GraemeWatt left a comment

Uh oh!

GraemeWatt Oct 21, 2025

Uh oh!

mhabedan Jan 23, 2026

Uh oh!

GraemeWatt Oct 21, 2025

Uh oh!

mhabedan Jan 22, 2026

Uh oh!

GraemeWatt commented Oct 21, 2025

Uh oh!

This comment has been minimized.

codecov-ai bot Oct 21, 2025

Uh oh!

mhabedan commented Jan 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		else:
		raise TypeError(f"Cannot return results in specfied format: {format}. Allowed formats are: {str}, {list}, {set}, {tuple}.")

Let find() take format as argument to allow returning as list #10

Are you sure you want to change the base?

Let find() take format as argument to allow returning as list #10

Conversation

mhabedan commented Oct 15, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

GraemeWatt left a comment

Choose a reason for hiding this comment

Uh oh!

GraemeWatt Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

mhabedan Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

GraemeWatt Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

mhabedan Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

GraemeWatt commented Oct 21, 2025

Uh oh!

This comment has been minimized.

codecov-ai bot Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

mhabedan commented Jan 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants