CGamesPlay/OpenAI Token Counting.ipynb

Last active October 27, 2024 00:38

Star (22) You must be signed in to star a gist
Fork (6) You must be signed in to fork a gist

Select an option

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/CGamesPlay/dd4f108f27e2eec145eedf5c717318f5.js"></script>
Save CGamesPlay/dd4f108f27e2eec145eedf5c717318f5 to your computer and use it in GitHub Desktop.

Download ZIP

Raw

OpenAI Token Counting.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

Author

CGamesPlay commented Jan 3, 2024

I applied your ensure_ascii=False change, but I disagree about your FUNCTION_OVERHEAD change. Instead, OpenAI appears to now be formatting integers as "1" instead of "1.0".

    def format_default(schema):
        v = schema["default"]
        if schema["type"] == "number":
            return f"{v:.0f}" if float(v).is_integer() else str(v)
        else:
            return str(v)

The gist is updated with these changes (tests still pass).

eshamanideep commented Feb 1, 2024 •

edited

Loading

Amazing work! The latest gpt-3.5-turbo and gpt-4-turbo add support for parallel tool calls by injecting an extra tool. Here is the namespace and description I obtained from the OpenAI's API:
(continuation from the normal functions namespace)

// namespace functions

## multi_tool_use

// This tool serves as a wrapper for utilizing multiple tools. Each tool that can be used must be specified in the tool sections. Only tools in the functions namespace are permitted.
// Ensure that the parameters provided to each tool are valid according to that tool's specification.
namespace multi_tool_use {

// Use this function to run multiple tools simultaneously, but only if they can operate in parallel. Do this even if the prompt suggests using the tools sequentially.
type parallel = (_: {
// The tools to be executed in parallel. NOTE: only functions tools are permitted
tool_uses: {
// The name of the tool to use. The format should either be just the name of the tool, or in the format namespace.function_name for plugin and function tools.
recipient_name: string,
// The parameters to pass to the tool. Ensure these are valid according to the tool's own specifications.
parameters: object,
}[],
}) => any;

Author

CGamesPlay commented Jul 10, 2024 •

edited

Loading

I've updated the notebook to use the new tool calling interface and support the parallel tool calling option. Notable changes:

Validations are now exposed to the model (minimum, maximum, pattern).
Enums are no longer exposed to the model (note: it's still possible that OpenAI supports them through controlled generation, but untested)
Type titles are now exposed to the model. If you are autogenerating the schema title from the field name, this is wasting tokens.

One interesting note is that the overhead of the parallel tool calls doesn't seem to be reflected in the prompt usage value.

msp26 commented Aug 6, 2024

https://openai.com/index/introducing-structured-outputs-in-the-api/

Any changes from this?

junluowitrocking commented Oct 25, 2024

I've updated the notebook to use the new tool calling interface and support the parallel tool calling option. Notable changes:

Validations are now exposed to the model (minimum, maximum, pattern).

Enums are no longer exposed to the model (note: it's still possible that OpenAI supports them through controlled generation, but untested)

Type titles are now exposed to the model. If you are autogenerating the schema title from the field name, this is wasting tokens.

One interesting note is that the overhead of the parallel tool calls doesn't seem to be reflected in the prompt usage value.

Thanks for this wonderful work and details regarding the tool calling interface.

May you share some insights why OpenAI does not expose Enum to model? is it because the performance is not desirable when providing Enum items?
I noticed that "features" and "content" in send_message_tool are not exposed to the model even they are required. Any insights regarding that?
Thank you!

Author

CGamesPlay commented Oct 27, 2024

I don't work at OpenAI; I just documented observed behavior.

You're right about "features" and "content". I guess because they are "object" types, and they only allow a single flat object to be provided. Note: behavior may have changed since the notebook was last updated, I would advise verifying before implementing in your stack.

CGamesPlay/OpenAI Token Counting.ipynb

Select an option

No results found

Select an option