[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Quick note on Disk 18a
Michael and all,
Well, it is a question of what you mean by a "large peak" and a "small" peak.
The specks of the ADC call for "no missing codes" to 15 bits. They are
actually better than that.
What you are describing is called "differential linearity".
You did not say where the spike is. It is common for there to be a problem
at the large bit change boundaries. For example in going from 111111 (63)
to 1000000 (64). The problem can be in either direction.
Suppose that we look at "white" noise. That is all values in the range are
equally likely. If we take 1000 counts in each bin, then we expect to see
1000 counts in 63 (+/- 32) and 1000 counts in 64. Suppose we see 2000 in
63 and 0 in 64. This looks awful. It also meets the specification. In
fact it means that we are in error by one count in the measurement for half
of the data at this point. But what does it mean to our data?
It means that some of our stars may be measured in error by one count. OK,
in a systematic way. So we have to consider if that hurts us. But note
that our sky noise is typically 50 ADU, so we are looking for effects much
less than the noise.
Suppose we measured a zillion stars. Then made a plot of magnitude vs
number of stars at that magnitude. This would have a shape due to the
distribution of stars in the sky. It would (with enough data) have a fine
structure that was due to the ADC. I think such fine structure will be
smaller than the error. But this needs to be given proper analysis.
Suppose we consider a bunch of buckets spread out over a line catching
random rain drops. Differential linearity is a measure of the diameter of
each bucket. In our case, we are allowed a bucket of zero width. Integral
linearity is a measure of the uniform spacing of the buckets. In our case,
this is specified to be +/- four ADU over the full range. Thus bucket
23,567 should be within 16 positions of its specified position. This means
that all the buckets in between 1 and 23,567 have to be stretched or
squished so that no two adjacent buckets are of zero width or more that
twice their average diameter. One can have regions of wide and narrow
buckets as long as no bucket is off by more than 16 positions. But note
that this is better than 0.03%. We thus measure any block of photons to
at least one part in 4,000. One tends to be able to calibrate out the
Integral Linearity, so it is really better than this.
> Perhaps some sticky bits in the ADC?
Yep, all ADCs have sticky bits. Well, not all, there is the Wilkensen
type, but it is slow. The system noise actually helps a lot to mask this
generic problem. And as I say, this one seems to meet specs, and I think
is good enough for our system.
> It helps to combine the data into bins at least 8 values wide,
>but that leaves the every-64'th-pixel peaks. I can't make the bins
>64 values wide without losing resolution -- I think.
I think this is the wrong thing to do. You are introducing a ruler with
graduations of 64 because the error in position of the lines is
one. (Well, actually a few)
If this bothers you, (I think it should not) then try using non binary
binning. Using binary bins does little good for a binary problem. Say bin
by 3s or 5s. Better a weighted average over 5 bins.
I would be happy to discuss this further for anyone that is bothered by the
errors in the system. As a system designer, I have some options that can
trade off problems. Perhaps we can make the system better. There is also
a slightly better grade ADC. I bit better in both Integral Linearity and
Differential Linearity. I think we could not see the difference. But we
could discuss this.
I am delighted to hear of the progress. I can't wait to get hold of some
star lists. At least I think I can't. In actual fact it will be a couple
of years before I will be pondering data. Just too many things to fix.
Tom Droege
At 12:59 PM 5/20/01 -0400, you wrote:
> I've been working on a new distribution kit for my Mark IV
>reduction pipeline. I grabbed the first few images off Data Disk 18a
>to test the code, and noticed something odd.
>
> If one makes a histogram of the pixel values in the raw images
>on Disk 18a, one finds
>
> - the distribution of pixel values in I-band images shows
> a small peak every 8'th value, and a very large
> peak every 64'th value
>
> - the distribution of pixel values in V-band images shows
> a small peak every 2'nd or 4'th value; the amplitude of
> the variations is smaller than those in the I-band images
>
> Perhaps some sticky bits in the ADC?
>
> I have to be careful here -- the pipeline uses an XVista program
>called "sky" to calculate the sky value in each image. That program
>computes a single "sky" value for the frame by fitting a gaussian
>to the distribution of pixel values. The results of this fit may
>not be well defined when there are big jumps every N'th pixel.
>
> It helps to combine the data into bins at least 8 values wide,
>but that leaves the every-64'th-pixel peaks. I can't make the bins
>64 values wide without losing resolution -- I think.
>
> Anyway, just a warning to other data munchers.
>
> On the bright side, when I ran the pipeline on a set of 8 images
>in the middle of disk 18a -- which did show evidence for clouds --
>it successfully matched up every one with stars from the Tycho catalog
>and found a decent astrometric solution. Rah!
>
> I discovered that I need at least 45 or so reference stars per
>field in order to achieve success (with a 3rd-order polynomial distortion
>model). I wasn't getting that many Tycho stars when I cut the catalog
>at sigma(Vt) <= 0.05 mag, so I relaxed the requirement to
>
> sigma(Vt) <= 0.07 mag and sigma(Bt) <= 0.07 mag
>
>This gives me enough reference stars in every field I've checked
>so far.
>
> Michael Richmond