Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

reduce allocations for string(::IEEEFloat) #57977

Open
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

oscardssmith
Copy link
Member

@oscardssmith oscardssmith commented Apr 1, 2025

julia> @time string(1.0)
  0.000004 seconds (3 allocations: 432 bytes) # before
  0.000007 seconds (3 allocations: 112 bytes) # after
julia> io = IOBuffer();
julia> @time show(io, 1.0)
  0.000014 seconds (3 allocations: 432 bytes) # before
  0.000012 seconds (1 allocation: 48 bytes) # after

mitigates #57976

@JeffBezanson JeffBezanson added the performance Must go faster label Apr 1, 2025
@bvdmitri
Copy link
Contributor

bvdmitri commented Apr 2, 2025

Can this PR also fix Printf.format in a similar manner? Currently it does allocate an intermediate array too and works almost in exactly same way as the show method. With the Printf.format though by using some internal API it is possible to do non-allocating printing of a number

julia> buf = Vector{UInt8}(undef, 10)
10-element Vector{UInt8}:
 0x69
 0x09
 0x00
 0x00
 0x00
 0x00
 0x00
 0x00
 0x6a
 0x09

julia> format = Printf.format"%.4f"
Printf.Format{Base.CodeUnits{UInt8, String}, Tuple{Printf.Spec{Val{'f'}}}}(UInt8[0x25, 0x2e, 0x34, 0x66], UnitRange{Int64}[1:0, 5:4], (%.4f,), 1)

julia> @time Printf.format(buf, 1, format, 1.0)
  0.000016 seconds
7

However, the signature of format is constrained to Vector and cannot use the Memory

@oscardssmith
Copy link
Member Author

I think printf should be left to a separate PR. The last thing to do here, is I would appreciate someone double checking that the numbers I've put down for neededfloatdigits are correct (and ideally find the formula that generates them). I believe these are the right numbers, but my search for them wasn't exactly principled)

@@ -111,16 +120,16 @@ end

function Base.show(io::IO, x::T, forceuntyped::Bool=false, fromprint::Bool=false) where {T <: Base.IEEEFloat}
compact = get(io, :compact, false)::Bool
buf = Base.StringVector(neededdigits(T))
buf = Memory{UInt8}(undef, neededfloatdigits(T))
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Extra bytes are needed (for "Float16()") if "typed && T isa Float16"

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I included this in neededfloatdigits(::Float16), but I guess it would make sense to separate it so we can allocate less when !typed


Number of digits necessary to represent type `T` in shortest precision.
"""
neededfloatdigits(::Type{Float64}) = 23
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"If an IEEE 754 double-precision number is converted to a decimal string with at least 17 significant digits, and then converted back to double-precision representation, the final result must match the original number"

17 + decimal point + sign bit + "e" + second sign bit + max exponent (3) = 24?

https://en.wikipedia.org/wiki/Double-precision_floating-point_format

Number of digits necessary to represent type `T` in shortest precision.
"""
neededfloatdigits(::Type{Float64}) = 23
neededfloatdigits(::Type{Float32}) = 15
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

9 + decimal point + sign bit + "f" + second sign bit + max exponent (2) = 15

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

julia> extrema(i->length(string(reinterpret(Float32, i))), typemin(UInt32):typemax(UInt32))
(3, 15)

"""
neededfloatdigits(::Type{Float64}) = 23
neededfloatdigits(::Type{Float32}) = 15
neededfloatdigits(::Type{Float16}) = 18
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

5 + decimal point + sign bit + "e" + second sign bit + max exponent (2) = 11

julia> extrema(i->length(string(reinterpret(Float16, i))), typemin(UInt16):typemax(UInt16))
(3, 11)

(so maximum = 20 with typed=true)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance Must go faster
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants