- 
                Notifications
    You must be signed in to change notification settings 
- Fork 128
RFC: Check element types #257
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| let type_object = T::type_object(py); | ||
| let iterator = NpySingleIterBuilder::readwrite(array).build()?; | ||
|  | ||
| for element in iterator { | ||
| if !type_object.is_instance(element)? { | ||
| return Err(PyDowncastError::new(array, type_name::<T>()).into()); | ||
| } | ||
| } | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This could be probably written a bit shorter like
if NpySingleIterBuiler::readwrite(array).build()?.any(|el| !type_obj.is_instance(el)) {
    return Err(...);
}There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Possibly, but I am not sure we can use the iterators at all as they produce references to T, but we are still checking whether we actually have T or something else entirely.
| { | ||
| const IS_COPY: bool = false; | ||
|  | ||
| fn check_element_types<D: Dimension>(py: Python, array: &PyArray<Self, D>) -> PyResult<()> { | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What I personally don't like is that there's again logic being added into Element, so it goes from being a descriptive trait to an 'actionable' trait once again.
Perhaps something like this might be cleaner? (and then the element-type checking logic can be moved out of here somewhere into array)
pub unsafe trait Element {
    const IS_COPY: bool;
    fn py_type(py: Python) -> Option<&PyType>; // None for anything with IS_COPY=true or PyAny
    fn get_dtype(py: Python) -> &PyArrayDescr;
}There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What I personally don't like is that there's again logic being added into Element, so it goes from being a descriptive trait to an 'actionable' trait once again.
The point of this was to make the policy changeable by user code, i.e. I really known I'll get PyArray<T, D> and do not want to pay the cost of checking the types, I can implement this unsoundly myself. But I guess due to the general issues, using PyArray<PyObject, D> together with unchecked_downcast would be the way to make this work. We just need to document this...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I guess my point was, it doesn't feel like it should be stuffed in here. Do you have an example in mind where you would implement something custom for your type that doesn't just go through the elements and checks instance types? Like, what else can you reasonably do?
Another disadantage here being, if I do want to do something custom downstream but similar to what's being done here, I would basically have to literally copy-paste rust-numpy source code that iterates over numpy array items - that's not very nice.
And finally, the "don't check this" part doesn't sound like it belongs to the type itself, but rather to the call point. If you bind it to the type, you can't do "here I know it's safe, don't check, but here it's arbitrary user input, need to check" which would be a fairly reasonable use case, I believe. This you could probably achieve by having a separate trait on PyArray<T, D> (pulling this stuff out of T: Element) with default impl that always checks instance types for any Py<T> that's not PyAny, and then add a newtype wrapper like Unchecked<PyArray<T, D>> that would override this and impl a FromPyObject for it. Just a thought. (and, Unchecked is probably too generic a name here)
(Note that you could probably also achieve the above by having an Unchecked<T> but this would be much worse from the ergonomics standpoint as now you have an array of ugly wrappers instead of a single wrapper of an array.)
| 
 There's a few distinct cases: 
 x = np.arange(3)
assert x.flags.owndata
x.setflags(write=False)
y = x.view()
y.setflags(write=True)  # ValueError: cannot set WRITEABLE flag to True of this array | 
| 
 I would like to not prevent this, but I have to produce a sound interface handling both cases. Having separate  | 
| I have added stronger language to #255 based on the aliasing issue raised here. | 
…ng Py<T> with custom types T.
febacd6    to
    f31030e      
    Compare
  
    | While considering whether to implement 
 to enable 
 I noticed that even that isn't that simple: As soon as you one does something like let array: PyArray<Py<T>, D> = ...;
callable.call1(py, (array,));then  So I guess the only thing we should provide is a zero-copy function to go from  | 
This a single WIP commit on top of #256. I wonder if it would enable us to salvage custom element types but I have many questions:
Py<T>withT: PyTypeInforeally a sufficient bound to allow as elements of object arrays? @davidhewittPyArray<Py<T>, D>to the check method when we are still figuring out whether the elements are indeedPy<T>?&mut Py<T>which is almost surely UB if the element is not in factPy<T>? So this would be mean using the FFI directly?But finally, the array might be aliasing by Python code. So who is to say that after checking the element types, the Python code won't go ahead and replace individual elements by differently typed objects? This would suggest that we can never support object arrays containing anything but
PyObject?