This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [PATCH] add overlap function to gcov-tool


> On Tue, Oct 7, 2014 at 9:31 PM, Jan Hubicka <hubicka@ucw.cz> wrote:
> >> Hi,
> >>
> >> This patch adds overlap functionality to gcov-tool. The overlap score
> >> estimates the similarity of two profiles. Currently it only computes
> >> overlap for arc counters.
> >>
> >> The overlap score is defined as
> >> \sum minimum (p1-counter[i] / p1-sum-all, p2-counter[i] / p2-sum-all)
> >> where p1-counter[i] and p2-counter[2] are two matched counter from
> >> profile1 and profiler2.
> >> p1-sum-all and p2-sum-all are the sum-all counters in profiler1 and
> >> profile2, repetitively.
> >
> > The patch looks fine in general.  My statistics is all rusty, but can't we use
> > one of the established techniques like Kullback-Leibler to compare the
> > probabilitis distributions?
> 
> Interesting. I never thought of using Kullback-Leibler divergence.
> It's very easy to switch to KL using this overlap framework -- only a
> few lines of change.
> The problem is KL divergence assumes absolute continuity (i.e. q(i)
> ==0 --> p(i) =0, which is
> not true in our distribution, I'm not sure how to work around this.).
> 
> I did try earth-mover-distance (EMD) in our earlier internal version.
> But since I used
> uniform distance, the problem can be simplified to distribution diffs.

I see, well, i guess your metric is easy to underatnd and fits the bill.
> 
> > It would be also nice to have ability to compare
> > branch probabilities in btween train runs.
> 
> Do you mean to do the comparison in CFG rather on the raw counters?
> We need gcno file to reconstruct the CFG. That needs some work.

Yes, I think in longer term we want to have gcov functionality as a
library with resonable API, so it can be used by gcov/gcov-tool
and to implement instrumentation at linktime with LTO (here we will
need to decide on instrumentaiton counter placement without actually
having a function body streamed in, so I think we can have graph abstraction
class and implement the CFG/instrumentation placement/solving around it.
The graph may be either real CFG in compiler or one read from GCDA files.

The patch is OK.
Honza
> 
> -Rong
> 
> >
> > Honza
> >>
> >> The resulting score is a value ranging from 0.0 to 1.0 where 0.0 means
> >> no match and 1.0 mean a perfect match.
> >>
> >> This tool can be used in performance triaging and reducing the fdo
> >> training set size (where similar inputs can be pruned).
> >>
> >> Tested with spec2006 profiles.
> >>
> >> Thanks,
> >>
> >> -Rong
> >
> >> 2014-10-07  Rong Xu  <xur@google.com>
> >>
> >>       * gcc/gcov-tool.c (profile_overlap): New driver function
> >>         to compute profile overlap.
> >>       (print_overlap_usage_message): New.
> >>       (overlap_usage): New.
> >>       (do_overlap): New.
> >>       (print_usage): Add calls to overlap function.
> >>       (main): Ditto.
> >>       * libgcc/libgcov-util.c (read_gcda_file): Fix format.
> >>       (find_match_gcov_info): Ditto.
> >>       (calculate_2_entries): New.
> >>       (compute_one_gcov): Ditto.
> >>       (gcov_info_count_all_cold): Ditto.
> >>       (gcov_info_count_all_zero): Ditto.
> >>       (extract_file_basename): Ditto.
> >>       (get_file_basename): Ditto.
> >>       (set_flag): Ditto.
> >>       (matched_gcov_info): Ditto.
> >>       (calculate_overlap): Ditto.
> >>       (gcov_profile_overlap): Ditto.
> >>       * libgcc/libgcov-driver.c (compute_summary): Make
> >>         it avavilable for external calls.
> >>       * gcc/doc/gcov-tool.texi: Add documentation.
> >>
> >> Index: gcc/gcov-tool.c
> >> ===================================================================
> >> --- gcc/gcov-tool.c   (revision 215981)
> >> +++ gcc/gcov-tool.c   (working copy)
> >> @@ -39,6 +39,7 @@ see the files COPYING3 and COPYING.RUNTIME respect
> >>  #include <getopt.h>
> >>
> >>  extern int gcov_profile_merge (struct gcov_info*, struct gcov_info*, int, int);
> >> +extern int gcov_profile_overlap (struct gcov_info*, struct gcov_info*);
> >>  extern int gcov_profile_normalize (struct gcov_info*, gcov_type);
> >>  extern int gcov_profile_scale (struct gcov_info*, float, int, int);
> >>  extern struct gcov_info* gcov_read_profile_dir (const char*, int);
> >> @@ -368,6 +369,121 @@ do_rewrite (int argc, char **argv)
> >>    return ret;
> >>  }
> >>
> >> +/* Driver function to computer the overlap score b/w profile D1 and D2.
> >> +   Return 1 on error and 0 if OK.  */
> >> +
> >> +static int
> >> +profile_overlap (const char *d1, const char *d2)
> >> +{
> >> +  struct gcov_info *d1_profile;
> >> +  struct gcov_info *d2_profile;
> >> +
> >> +  d1_profile = gcov_read_profile_dir (d1, 0);
> >> +  if (!d1_profile)
> >> +    return 1;
> >> +
> >> +  if (d2)
> >> +    {
> >> +      d2_profile = gcov_read_profile_dir (d2, 0);
> >> +      if (!d2_profile)
> >> +        return 1;
> >> +
> >> +      return gcov_profile_overlap (d1_profile, d2_profile);
> >> +    }
> >> +
> >> +  return 1;
> >> +}
> >> +
> >> +/* Usage message for profile overlap.  */
> >> +
> >> +static void
> >> +print_overlap_usage_message (int error_p)
> >> +{
> >> +  FILE *file = error_p ? stderr : stdout;
> >> +
> >> +  fnotice (file, "  overlap [options] <dir1> <dir2>       Compute the overlap of two profiles\n");
> >> +  fnotice (file, "    -v, --verbose                       Verbose mode\n");
> >> +  fnotice (file, "    -h, --hotonly                       Only print info for hot objects/functions\n");
> >> +  fnotice (file, "    -f, --function                      Print function level info\n");
> >> +  fnotice (file, "    -F, --fullname                      Print full filename\n");
> >> +  fnotice (file, "    -o, --object                        Print object level info\n");
> >> +  fnotice (file, "    -t <float>, --hot_threshold <float> Set the threshold for hotness\n");
> >> +
> >> +}
> >> +
> >> +static const struct option overlap_options[] =
> >> +{
> >> +  { "verbose",                no_argument,       NULL, 'v' },
> >> +  { "function",               no_argument,       NULL, 'f' },
> >> +  { "fullname",               no_argument,       NULL, 'F' },
> >> +  { "object",                 no_argument,       NULL, 'o' },
> >> +  { "hotonly",                no_argument,       NULL, 'h' },
> >> +  { "hot_threshold",          required_argument, NULL, 't' },
> >> +  { 0, 0, 0, 0 }
> >> +};
> >> +
> >> +/* Print overlap usage and exit.  */
> >> +
> >> +static void
> >> +overlap_usage (void)
> >> +{
> >> +  fnotice (stderr, "Overlap subcomand usage:");
> >> +  print_overlap_usage_message (true);
> >> +  exit (FATAL_EXIT_CODE);
> >> +}
> >> +
> >> +int overlap_func_level;
> >> +int overlap_obj_level;
> >> +int overlap_hot_only;
> >> +int overlap_use_fullname;
> >> +double overlap_hot_threshold = 0.005;
> >> +
> >> +/* Driver for profile overlap sub-command.  */
> >> +
> >> +static int
> >> +do_overlap (int argc, char **argv)
> >> +{
> >> +  int opt;
> >> +  int ret;
> >> +
> >> +  optind = 0;
> >> +  while ((opt = getopt_long (argc, argv, "vfFoht:", overlap_options, NULL)) != -1)
> >> +    {
> >> +      switch (opt)
> >> +        {
> >> +        case 'v':
> >> +          verbose = true;
> >> +          gcov_set_verbose ();
> >> +          break;
> >> +        case 'f':
> >> +          overlap_func_level = 1;
> >> +          break;
> >> +        case 'F':
> >> +          overlap_use_fullname = 1;
> >> +          break;
> >> +        case 'o':
> >> +          overlap_obj_level = 1;
> >> +          break;
> >> +        case 'h':
> >> +          overlap_hot_only = 1;
> >> +          break;
> >> +        case 't':
> >> +          overlap_hot_threshold = atof (optarg);
> >> +          break;
> >> +        default:
> >> +          overlap_usage ();
> >> +        }
> >> +    }
> >> +
> >> +  if (argc - optind == 2)
> >> +    ret = profile_overlap (argv[optind], argv[optind+1]);
> >> +  else
> >> +    overlap_usage ();
> >> +
> >> +  return ret;
> >> +}
> >> +
> >> +
> >>  /* Print a usage message and exit.  If ERROR_P is nonzero, this is an error,
> >>     otherwise the output of --help.  */
> >>
> >> @@ -383,6 +499,7 @@ print_usage (int error_p)
> >>    fnotice (file, "  -v, --version                         Print version number, then exit\n");
> >>    print_merge_usage_message (error_p);
> >>    print_rewrite_usage_message (error_p);
> >> +  print_overlap_usage_message (error_p);
> >>    fnotice (file, "\nFor bug reporting instructions, please see:\n%s.\n",
> >>             bug_report_url);
> >>    exit (status);
> >> @@ -471,6 +588,8 @@ main (int argc, char **argv)
> >>      return do_merge (argc - optind, argv + optind);
> >>    else if (!strcmp (sub_command, "rewrite"))
> >>      return do_rewrite (argc - optind, argv + optind);
> >> +  else if (!strcmp (sub_command, "overlap"))
> >> +    return do_overlap (argc - optind, argv + optind);
> >>
> >>    print_usage (true);
> >>  }
> >> Index: libgcc/libgcov-util.c
> >> ===================================================================
> >> --- libgcc/libgcov-util.c     (revision 215981)
> >> +++ libgcc/libgcov-util.c     (working copy)
> >> @@ -319,59 +319,59 @@ read_gcda_file (const char *filename)
> >>
> >>        tag = gcov_read_unsigned ();
> >>        if (!tag)
> >> -     break;
> >> +        break;
> >>        length = gcov_read_unsigned ();
> >>        base = gcov_position ();
> >>        mask = GCOV_TAG_MASK (tag) >> 1;
> >>        for (tag_depth = 4; mask; mask >>= 8)
> >> -     {
> >> -       if (((mask & 0xff) != 0xff))
> >> -         {
> >> -           warning (0, "%s:tag `%x' is invalid\n", filename, tag);
> >> -           break;
> >> -         }
> >> -       tag_depth--;
> >> -     }
> >> +        {
> >> +          if (((mask & 0xff) != 0xff))
> >> +            {
> >> +              warning (0, "%s:tag `%x' is invalid\n", filename, tag);
> >> +              break;
> >> +            }
> >> +          tag_depth--;
> >> +        }
> >>        for (format = tag_table; format->name; format++)
> >> -     if (format->tag == tag)
> >> -       goto found;
> >> +        if (format->tag == tag)
> >> +          goto found;
> >>        format = &tag_table[GCOV_TAG_IS_COUNTER (tag) ? 2 : 1];
> >>      found:;
> >>        if (tag)
> >> -     {
> >> -       if (depth && depth < tag_depth)
> >> -         {
> >> -           if (!GCOV_TAG_IS_SUBTAG (tags[depth - 1], tag))
> >> -             warning (0, "%s:tag `%x' is incorrectly nested\n",
> >> -                     filename, tag);
> >> -         }
> >> -       depth = tag_depth;
> >> -       tags[depth - 1] = tag;
> >> -     }
> >> +        {
> >> +          if (depth && depth < tag_depth)
> >> +            {
> >> +              if (!GCOV_TAG_IS_SUBTAG (tags[depth - 1], tag))
> >> +                warning (0, "%s:tag `%x' is incorrectly nested\n",
> >> +                         filename, tag);
> >> +            }
> >> +          depth = tag_depth;
> >> +          tags[depth - 1] = tag;
> >> +        }
> >>
> >>        if (format->proc)
> >>          {
> >> -       unsigned long actual_length;
> >> +          unsigned long actual_length;
> >>
> >> -       (*format->proc) (tag, length);
> >> +          (*format->proc) (tag, length);
> >>
> >> -       actual_length = gcov_position () - base;
> >> -       if (actual_length > length)
> >> -         warning (0, "%s:record size mismatch %lu bytes overread\n",
> >> -                 filename, actual_length - length);
> >> -       else if (length > actual_length)
> >> -         warning (0, "%s:record size mismatch %lu bytes unread\n",
> >> -                 filename, length - actual_length);
> >> -     }
> >> +          actual_length = gcov_position () - base;
> >> +          if (actual_length > length)
> >> +            warning (0, "%s:record size mismatch %lu bytes overread\n",
> >> +                     filename, actual_length - length);
> >> +          else if (length > actual_length)
> >> +            warning (0, "%s:record size mismatch %lu bytes unread\n",
> >> +                     filename, length - actual_length);
> >> +       }
> >>
> >>        gcov_sync (base, length);
> >>        if ((error = gcov_is_error ()))
> >> -     {
> >> -       warning (0, error < 0 ? "%s:counter overflow at %lu\n" :
> >> -                               "%s:read error at %lu\n", filename,
> >> -               (long unsigned) gcov_position ());
> >> -       break;
> >> -     }
> >> +        {
> >> +          warning (0, error < 0 ? "%s:counter overflow at %lu\n" :
> >> +                                  "%s:read error at %lu\n", filename,
> >> +                   (long unsigned) gcov_position ());
> >> +          break;
> >> +        }
> >>      }
> >>
> >>    read_gcda_finalize (obj_info);
> >> @@ -577,7 +577,8 @@ gcov_merge (struct gcov_info *info1, struct gcov_i
> >>     Return NULL if there is no match.  */
> >>
> >>  static struct gcov_info *
> >> -find_match_gcov_info (struct gcov_info **array, int size, struct gcov_info *info)
> >> +find_match_gcov_info (struct gcov_info **array, int size,
> >> +                   struct gcov_info *info)
> >>  {
> >>    struct gcov_info *gi_ptr;
> >>    struct gcov_info *ret = NULL;
> >> @@ -872,7 +873,530 @@ gcov_profile_normalize (struct gcov_info *profile,
> >>
> >>    scale_factor = (float)max_val / curr_max_val;
> >>    if (verbose)
> >> -    fnotice (stdout, "max_val is %lld\n", (long long) curr_max_val);
> >> +    fnotice (stdout, "max_val is %"PRId64"\n", curr_max_val);
> >>
> >>    return gcov_profile_scale (profile, scale_factor, 0, 0);
> >>  }
> >> +
> >> +/* The following variables are defined in gcc/gcov-tool.c.  */
> >> +extern int overlap_func_level;
> >> +extern int overlap_obj_level;
> >> +extern int overlap_hot_only;
> >> +extern int overlap_use_fullname;
> >> +extern double overlap_hot_threshold;
> >> +
> >> +/* Compute the overlap score of two values. The score is defined as:
> >> +    min (V1/SUM_1, V2/SUM_2)  */
> >> +
> >> +static double
> >> +calculate_2_entries (const unsigned long v1, const unsigned long v2,
> >> +                     const double sum_1, const double sum_2)
> >> +{
> >> +  double val1 = (sum_1 == 0.0 ? 0.0 : v1/sum_1);
> >> +  double val2 = (sum_2 == 0.0 ? 0.0 : v2/sum_2);
> >> +
> >> +  if (val2 < val1)
> >> +    val1 = val2;
> >> +
> >> +  return val1;
> >> +}
> >> +
> >> +/*  Compute the overlap score between GCOV_INFO1 and GCOV_INFO2.
> >> +    SUM_1 is the sum_all for profile1 where GCOV_INFO1 belongs.
> >> +    SUM_2 is the sum_all for profile2 where GCOV_INFO2 belongs.
> >> +    This function also updates cumulative score CUM_1_RESULT and
> >> +    CUM_2_RESULT.  */
> >> +
> >> +static double
> >> +compute_one_gcov (const struct gcov_info *gcov_info1,
> >> +                  const struct gcov_info *gcov_info2,
> >> +                  const double sum_1, const double sum_2,
> >> +                  double *cum_1_result, double *cum_2_result)
> >> +{
> >> +  unsigned f_ix;
> >> +  double ret = 0;
> >> +  double cum_1 = 0, cum_2 = 0;
> >> +  const struct gcov_info *gcov_info = 0;
> >> +  double *cum_p;
> >> +  double sum;
> >> +
> >> +  gcc_assert (gcov_info1 || gcov_info2);
> >> +  if (!gcov_info1)
> >> +    {
> >> +      gcov_info = gcov_info2;
> >> +      cum_p = cum_2_result;
> >> +      sum = sum_2;
> >> +      *cum_1_result = 0;
> >> +    } else
> >> +  if (!gcov_info2)
> >> +    {
> >> +      gcov_info = gcov_info1;
> >> +      cum_p = cum_1_result;
> >> +      sum = sum_1;
> >> +      *cum_2_result = 0;
> >> +    }
> >> +
> >> +  if (gcov_info)
> >> +  {
> >> +    for (f_ix = 0; f_ix < gcov_info->n_functions; f_ix++)
> >> +      {
> >> +        unsigned t_ix;
> >> +        const struct gcov_fn_info *gfi_ptr = gcov_info->functions[f_ix];
> >> +        if (!gfi_ptr || gfi_ptr->key != gcov_info)
> >> +          continue;
> >> +        const struct gcov_ctr_info *ci_ptr = gfi_ptr->ctrs;
> >> +        for (t_ix = 0; t_ix < GCOV_COUNTERS_SUMMABLE; t_ix++)
> >> +          {
> >> +            unsigned c_num;
> >> +
> >> +            if (!gcov_info->merge[t_ix])
> >> +              continue;
> >> +
> >> +            for (c_num = 0; c_num < ci_ptr->num; c_num++)
> >> +              {
> >> +                cum_1 += ci_ptr->values[c_num] / sum;
> >> +              }
> >> +            ci_ptr++;
> >> +          }
> >> +      }
> >> +    *cum_p = cum_1;
> >> +    return 0.0;
> >> +  }
> >> +
> >> +  for (f_ix = 0; f_ix < gcov_info1->n_functions; f_ix++)
> >> +    {
> >> +      unsigned t_ix;
> >> +      double func_cum_1 = 0.0;
> >> +      double func_cum_2 = 0.0;
> >> +      double func_val = 0.0;
> >> +      int nonzero = 0;
> >> +      int hot = 0;
> >> +      const struct gcov_fn_info *gfi_ptr1 = gcov_info1->functions[f_ix];
> >> +      const struct gcov_fn_info *gfi_ptr2 = gcov_info2->functions[f_ix];
> >> +
> >> +      if (!gfi_ptr1 || gfi_ptr1->key != gcov_info1)
> >> +        continue;
> >> +      if (!gfi_ptr2 || gfi_ptr2->key != gcov_info2)
> >> +        continue;
> >> +
> >> +      const struct gcov_ctr_info *ci_ptr1 = gfi_ptr1->ctrs;
> >> +      const struct gcov_ctr_info *ci_ptr2 = gfi_ptr2->ctrs;
> >> +      for (t_ix = 0; t_ix < GCOV_COUNTERS_SUMMABLE; t_ix++)
> >> +        {
> >> +          unsigned c_num;
> >> +
> >> +          if (!gcov_info1->merge[t_ix])
> >> +            continue;
> >> +
> >> +          for (c_num = 0; c_num < ci_ptr1->num; c_num++)
> >> +            {
> >> +              if (ci_ptr1->values[c_num] | ci_ptr2->values[c_num])
> >> +                {
> >> +                  func_val += calculate_2_entries (ci_ptr1->values[c_num],
> >> +                                          ci_ptr2->values[c_num],
> >> +                                          sum_1, sum_2);
> >> +
> >> +                  func_cum_1 += ci_ptr1->values[c_num] / sum_1;
> >> +                  func_cum_2 += ci_ptr2->values[c_num] / sum_2;
> >> +                  nonzero = 1;
> >> +                  if (ci_ptr1->values[c_num] / sum_1 >= overlap_hot_threshold ||
> >> +                      ci_ptr2->values[c_num] / sum_2 >= overlap_hot_threshold)
> >> +                    hot = 1;
> >> +                }
> >> +            }
> >> +          ci_ptr1++;
> >> +          ci_ptr2++;
> >> +        }
> >> +      ret += func_val;
> >> +      cum_1 += func_cum_1;
> >> +      cum_2 += func_cum_2;
> >> +      if (overlap_func_level && nonzero && (!overlap_hot_only || hot))
> >> +        {
> >> +          printf("   \tfunc_id=%10d \toverlap =%6.5f%% (%5.5f%% %5.5f%%)\n",
> >> +                 gfi_ptr1->ident, func_val*100, func_cum_1*100, func_cum_2*100);
> >> +        }
> >> +    }
> >> +  *cum_1_result = cum_1;
> >> +  *cum_2_result = cum_2;
> >> +  return ret;
> >> +}
> >> +
> >> +/* Test if all counter values in this GCOV_INFO are cold.
> >> +   "Cold" is defined as the counter value being less than
> >> +   or equal to THRESHOLD.  */
> >> +
> >> +static bool
> >> +gcov_info_count_all_cold (const struct gcov_info *gcov_info,
> >> +                          gcov_type threshold)
> >> +{
> >> +  unsigned f_ix;
> >> +
> >> +  for (f_ix = 0; f_ix < gcov_info->n_functions; f_ix++)
> >> +    {
> >> +      unsigned t_ix;
> >> +      const struct gcov_fn_info *gfi_ptr = gcov_info->functions[f_ix];
> >> +
> >> +      if (!gfi_ptr || gfi_ptr->key != gcov_info)
> >> +        continue;
> >> +      const struct gcov_ctr_info *ci_ptr = gfi_ptr->ctrs;
> >> +      for (t_ix = 0; t_ix < GCOV_COUNTERS_SUMMABLE; t_ix++)
> >> +        {
> >> +          unsigned c_num;
> >> +
> >> +          if (!gcov_info->merge[t_ix])
> >> +            continue;
> >> +
> >> +          for (c_num = 0; c_num < ci_ptr->num; c_num++)
> >> +            {
> >> +              if (ci_ptr->values[c_num] > threshold)
> >> +                return false;
> >> +            }
> >> +          ci_ptr++;
> >> +        }
> >> +    }
> >> +
> >> +  return true;
> >> +}
> >> +
> >> +/* Test if all counter values in this GCOV_INFO are 0.  */
> >> +
> >> +static bool
> >> +gcov_info_count_all_zero (const struct gcov_info *gcov_info)
> >> +{
> >> +  return gcov_info_count_all_cold (gcov_info, 0);
> >> +}
> >> +
> >> +/* A pair of matched GCOV_INFO.
> >> +   The flag is a bitvector:
> >> +     b0: obj1's all counts are 0;
> >> +     b1: obj1's all counts are cold (but no 0);
> >> +     b2: obj1 is hot;
> >> +     b3: no obj1 to match obj2;
> >> +     b4: obj2's all counts are 0;
> >> +     b5: obj2's all counts are cold (but no 0);
> >> +     b6: obj2 is hot;
> >> +     b7: no obj2 to match obj1;
> >> + */
> >> +struct overlap_t {
> >> +   const struct gcov_info *obj1;
> >> +   const struct gcov_info *obj2;
> >> +   char flag;
> >> +};
> >> +
> >> +#define FLAG_BOTH_ZERO(flag) ((flag & 0x1) && (flag & 0x10))
> >> +#define FLAG_BOTH_COLD(flag) ((flag & 0x2) && (flag & 0x20))
> >> +#define FLAG_ONE_HOT(flag) ((flag & 0x4) || (flag & 0x40))
> >> +
> >> +/* Cumlative overlap dscore for profile1 and profile2.  */
> >> +static double overlap_sum_1, overlap_sum_2;
> >> +
> >> +/* sum_all for profile1 and profile2.  */
> >> +static gcov_type p1_sum_all, p2_sum_all;
> >> +
> >> +/* run_max for profile1 and profile2.  */
> >> +static gcov_type p1_run_max, p2_run_max;
> >> +
> >> +/* The number of gcda files in the profiles.  */
> >> +static unsigned gcda_files[2];
> >> +
> >> +/* The number of unique gcda files in the profiles
> >> +   (not existing in the other profile).  */
> >> +static unsigned unique_gcda_files[2];
> >> +
> >> +/* The number of gcda files that all counter values are 0.  */
> >> +static unsigned zero_gcda_files[2];
> >> +
> >> +/* The number of gcda files that all counter values are cold (but not 0).  */
> >> +static unsigned cold_gcda_files[2];
> >> +
> >> +/* The number of gcda files that includes hot counter values.  */
> >> +static unsigned hot_gcda_files[2];
> >> +
> >> +/* The number of gcda files with hot count value in either profiles.  */
> >> +static unsigned both_hot_cnt;
> >> +
> >> +/* The number of gcda files with all counts cold (but not 0) in
> >> +   both profiles. */
> >> +static unsigned both_cold_cnt;
> >> +
> >> +/* The number of gcda files with all counts 0 in both profiles.  */
> >> +static unsigned both_zero_cnt;
> >> +
> >> +/* Extract the basename of the filename NAME.  */
> >> +
> >> +static char *
> >> +extract_file_basename (const char *name)
> >> +{
> >> +  char *str;
> >> +  int len = 0;
> >> +  char *path = xstrdup (name);
> >> +  char sep_str[2];
> >> +
> >> +  sep_str[0] = DIR_SEPARATOR;
> >> +  sep_str[1] = 0;
> >> +  str = strstr(path, sep_str);
> >> +  do{
> >> +      len = strlen(str) + 1;
> >> +      path = &path[strlen(path) - len + 2];
> >> +      str = strstr(path, sep_str);
> >> +  } while(str);
> >> +
> >> +  return path;
> >> +}
> >> +
> >> +/* Utility function to get the filename.  */
> >> +
> >> +static const char *
> >> +get_file_basename (const char *name)
> >> +{
> >> +  if (overlap_use_fullname)
> >> +    return name;
> >> +  return extract_file_basename (name);
> >> +}
> >> +
> >> +/* A utility function to set the flag for the gcda files.  */
> >> +
> >> +static void
> >> +set_flag (struct overlap_t *e)
> >> +{
> >> +  char flag = 0;
> >> +
> >> +  if (!e->obj1)
> >> +    {
> >> +      unique_gcda_files[1]++;
> >> +      flag = 0x8;
> >> +    }
> >> +  else
> >> +    {
> >> +      gcda_files[0]++;
> >> +      if (gcov_info_count_all_zero (e->obj1))
> >> +        {
> >> +          zero_gcda_files[0]++;
> >> +          flag = 0x1;
> >> +        }
> >> +      else
> >> +      if (gcov_info_count_all_cold (e->obj1, overlap_sum_1
> >> +                           * overlap_hot_threshold))
> >> +        {
> >> +          cold_gcda_files[0]++;
> >> +          flag = 0x2;
> >> +        }
> >> +      else
> >> +        {
> >> +          hot_gcda_files[0]++;
> >> +          flag = 0x4;
> >> +        }
> >> +    }
> >> +
> >> +  if (!e->obj2)
> >> +    {
> >> +      unique_gcda_files[0]++;
> >> +      flag |= (0x8 << 4);
> >> +    }
> >> +  else
> >> +    {
> >> +      gcda_files[1]++;
> >> +      if (gcov_info_count_all_zero (e->obj2))
> >> +        {
> >> +          zero_gcda_files[1]++;
> >> +          flag |= (0x1 << 4);
> >> +        }
> >> +      else
> >> +      if (gcov_info_count_all_cold (e->obj2, overlap_sum_2
> >> +                           * overlap_hot_threshold))
> >> +        {
> >> +          cold_gcda_files[1]++;
> >> +          flag |= (0x2 << 4);
> >> +        }
> >> +      else
> >> +        {
> >> +          hot_gcda_files[1]++;
> >> +          flag |= (0x4 << 4);
> >> +        }
> >> +    }
> >> +
> >> +  gcc_assert (flag);
> >> +  e->flag = flag;
> >> +}
> >> +
> >> +/* Test if INFO1 and INFO2 are from the matched source file.
> >> +   Return 1 if they match; return 0 otherwise.  */
> >> +
> >> +static int
> >> +matched_gcov_info (const struct gcov_info *info1, const struct gcov_info *info2)
> >> +{
> >> +  /* For FDO, we have to match the name. This can be expensive.
> >> +     Maybe we should use hash here.  */
> >> +  if (strcmp (info1->filename, info2->filename))
> >> +    return 0;
> >> +
> >> +  if (info1->n_functions != info2->n_functions)
> >> +    {
> >> +      fnotice (stderr, "mismatched profiles in %s (%d functions"
> >> +                       " vs %d functions)\n",
> >> +                       info1->filename,
> >> +                       info1->n_functions,
> >> +                       info2->n_functions);
> >> +      return 0;
> >> +    }
> >> +  return 1;
> >> +}
> >> +
> >> +/* Defined in libgcov-driver.c.  */
> >> +extern gcov_unsigned_t compute_summary (struct gcov_info *,
> >> +                 struct gcov_summary *, size_t *);
> >> +
> >> +/* Compute the overlap score of two profiles with the head of GCOV_LIST1 and
> >> +   GCOV_LIST1. Return a number ranging from [0.0, 1.0], with 0.0 meaning no
> >> +   match and 1.0 meaning a perfect match.  */
> >> +
> >> +static double
> >> +calculate_overlap (struct gcov_info *gcov_list1,
> >> +                   struct gcov_info *gcov_list2)
> >> +{
> >> +  struct gcov_summary this_prg;
> >> +  unsigned list1_cnt = 0, list2_cnt= 0, all_cnt;
> >> +  unsigned int i, j;
> >> +  size_t max_length;
> >> +  const struct gcov_info *gi_ptr;
> >> +  struct overlap_t *all_infos;
> >> +
> >> +  compute_summary (gcov_list1, &this_prg, &max_length);
> >> +  overlap_sum_1 = (double) (this_prg.ctrs[0].sum_all);
> >> +  p1_sum_all = this_prg.ctrs[0].sum_all;
> >> +  p1_run_max = this_prg.ctrs[0].run_max;
> >> +  compute_summary (gcov_list2, &this_prg, &max_length);
> >> +  overlap_sum_2 = (double) (this_prg.ctrs[0].sum_all);
> >> +  p2_sum_all = this_prg.ctrs[0].sum_all;
> >> +  p2_run_max = this_prg.ctrs[0].run_max;
> >> +
> >> +  for (gi_ptr = gcov_list1; gi_ptr; gi_ptr = gi_ptr->next)
> >> +    list1_cnt++;
> >> +  for (gi_ptr = gcov_list2; gi_ptr; gi_ptr = gi_ptr->next)
> >> +    list2_cnt++;
> >> +  all_cnt = list1_cnt + list2_cnt;
> >> +  all_infos = (struct overlap_t *) xmalloc (sizeof (struct overlap_t)
> >> +               * all_cnt * 2);
> >> +  gcc_assert (all_infos);
> >> +
> >> +  i = 0;
> >> +  for (gi_ptr = gcov_list1; gi_ptr; gi_ptr = gi_ptr->next, i++)
> >> +    {
> >> +      all_infos[i].obj1 = gi_ptr;
> >> +      all_infos[i].obj2 = 0;
> >> +    }
> >> +
> >> +  for (gi_ptr = gcov_list2; gi_ptr; gi_ptr = gi_ptr->next, i++)
> >> +    {
> >> +      all_infos[i].obj1 = 0;
> >> +      all_infos[i].obj2 = gi_ptr;
> >> +    }
> >> +
> >> +  for (i = list1_cnt; i < all_cnt; i++)
> >> +    {
> >> +      if (all_infos[i].obj2 == 0)
> >> +        continue;
> >> +      for (j = 0; j < list1_cnt; j++)
> >> +        {
> >> +          if (all_infos[j].obj2 != 0)
> >> +            continue;
> >> +          if (matched_gcov_info (all_infos[i].obj2, all_infos[j].obj1))
> >> +            {
> >> +              all_infos[j].obj2 = all_infos[i].obj2;
> >> +              all_infos[i].obj2 = 0;
> >> +              break;
> >> +            }
> >> +        }
> >> +    }
> >> +
> >> +  for (i = 0; i < all_cnt; i++)
> >> +    if (all_infos[i].obj1 || all_infos[i].obj2)
> >> +      {
> >> +        set_flag (all_infos + i);
> >> +        if (FLAG_ONE_HOT (all_infos[i].flag))
> >> +            both_hot_cnt++;
> >> +        if (FLAG_BOTH_COLD(all_infos[i].flag))
> >> +            both_cold_cnt++;
> >> +        if (FLAG_BOTH_ZERO(all_infos[i].flag))
> >> +            both_zero_cnt++;
> >> +      }
> >> +
> >> +  double prg_val = 0;
> >> +  double sum_val = 0;
> >> +  double sum_cum_1 = 0;
> >> +  double sum_cum_2 = 0;
> >> +
> >> +  for (i = 0; i < all_cnt; i++)
> >> +    {
> >> +      double val;
> >> +      double cum_1, cum_2;
> >> +      const char *filename;
> >> +
> >> +      if (all_infos[i].obj1 == 0 && all_infos[i].obj2 == 0)
> >> +        continue;
> >> +      if (FLAG_BOTH_ZERO (all_infos[i].flag))
> >> +          continue;
> >> +
> >> +      if (all_infos[i].obj1)
> >> +        filename = get_file_basename (all_infos[i].obj1->filename);
> >> +      else
> >> +        filename = get_file_basename (all_infos[i].obj2->filename);
> >> +
> >> +      if (overlap_func_level)
> >> +        printf("\n   processing %36s:\n", filename);
> >> +
> >> +      val = compute_one_gcov (all_infos[i].obj1, all_infos[i].obj2,
> >> +          overlap_sum_1, overlap_sum_2, &cum_1, &cum_2);
> >> +
> >> +      if (overlap_obj_level && (!overlap_hot_only || FLAG_ONE_HOT (all_infos[i].flag)))
> >> +        {
> >> +          printf("   obj=%36s  overlap = %6.2f%% (%5.2f%% %5.2f%%)\n",
> >> +                  filename, val*100, cum_1*100, cum_2*100);
> >> +          sum_val += val;
> >> +          sum_cum_1 += cum_1;
> >> +          sum_cum_2 += cum_2;
> >> +        }
> >> +
> >> +      prg_val += val;
> >> +
> >> +    }
> >> +
> >> +  if (overlap_obj_level)
> >> +    printf("   SUM:%36s  overlap = %6.2f%% (%5.2f%% %5.2f%%)\n",
> >> +           "", sum_val*100, sum_cum_1*100, sum_cum_2*100);
> >> +
> >> +  printf ("  Statistics:\n"
> >> +          "                    profile1_#     profile2_#       overlap_#\n");
> >> +  printf ("    gcda files:  %12u\t%12u\t%12u\n", gcda_files[0], gcda_files[1],
> >> +                                          gcda_files[0]-unique_gcda_files[0]);
> >> +  printf ("  unique files:  %12u\t%12u\n", unique_gcda_files[0],
> >> +                                        unique_gcda_files[1]);
> >> +  printf ("     hot files:  %12u\t%12u\t%12u\n", hot_gcda_files[0],
> >> +                                            hot_gcda_files[1], both_hot_cnt);
> >> +  printf ("    cold files:  %12u\t%12u\t%12u\n", cold_gcda_files[0],
> >> +                                            cold_gcda_files[1], both_cold_cnt);
> >> +  printf ("    zero files:  %12u\t%12u\t%12u\n", zero_gcda_files[0],
> >> +                                            zero_gcda_files[1], both_zero_cnt);
> >> +  printf ("       sum_all:  %12"PRId64"\t%12"PRId64"\n", p1_sum_all, p2_sum_all);
> >> +  printf ("       run_max:  %12"PRId64"\t%12"PRId64"\n", p1_run_max, p2_run_max);
> >> +
> >> +  return prg_val;
> >> +}
> >> +
> >> +/* Computer the overlap score of two lists of gcov_info objects PROFILE1 and PROFILE2.
> >> +   Return 0 on success: without mismatch. Reutrn 1 on error.  */
> >> +
> >> +int
> >> +gcov_profile_overlap (struct gcov_info *profile1, struct gcov_info *profile2)
> >> +{
> >> +  double result;
> >> +
> >> +  result = calculate_overlap (profile1, profile2);
> >> +
> >> +  if (result > 0)
> >> +    {
> >> +      printf("\nProgram level overlap result is %3.2f%%\n\n", result*100);
> >> +      return 0;
> >> +    }
> >> +  return 1;
> >> +}
> >> Index: libgcc/libgcov-driver.c
> >> ===================================================================
> >> --- libgcc/libgcov-driver.c   (revision 215981)
> >> +++ libgcc/libgcov-driver.c   (working copy)
> >> @@ -274,7 +274,10 @@ static struct gcov_summary_buffer *sum_buffer;
> >>     It computes and returns CRC32 and stored summary in THIS_PRG.
> >>     Also determines the longest filename length of the info files.  */
> >>
> >> -static gcov_unsigned_t
> >> +#if !IN_GCOV_TOOL
> >> +static
> >> +#endif
> >> +gcov_unsigned_t
> >>  compute_summary (struct gcov_info *list, struct gcov_summary *this_prg,
> >>                size_t *max_length)
> >>  {
> >> Index: gcc/doc/gcov-tool.texi
> >> ===================================================================
> >> --- gcc/doc/gcov-tool.texi    (revision 215981)
> >> +++ gcc/doc/gcov-tool.texi    (working copy)
> >> @@ -103,8 +103,7 @@ in these kind of counters.
> >>  @section Invoking @command{gcov-tool}
> >>
> >>  @smallexample
> >> -gcov-tool @r{[}@var{global-options}@r{]} SUB_COMMAND
> >> -@r{[}@var{sub_command-options}@r{]} @var{profile_dir}
> >> +gcov-tool @r{[}@var{global-options}@r{]} SUB_COMMAND @r{[}@var{sub_command-options}@r{]} @var{profile_dir}
> >>  @end smallexample
> >>
> >>  @command{gcov-tool} accepts the following options:
> >> @@ -123,6 +122,15 @@ gcov-tool rewrite [rewrite-options] @var{directory
> >>       [@option{-o}|@option{--output} @var{directory}]
> >>       [@option{-s}|@option{--scale} @var{float_or_simple-frac_value}]
> >>       [@option{-n}|@option{--normalize} @var{long_long_value}]
> >> +
> >> +gcov-tool overlap [overlap-options] @var{directory1} @var{directory2}
> >> +     [@option{-v}|@option{--verbose}]
> >> +     [@option{-h}|@option{--hotonly}]
> >> +     [@option{-f}|@option{--function}]
> >> +     [@option{-F}|@option{--fullname}]
> >> +     [@option{-o}|@option{--object}]
> >> +     [@option{-t}|@option{--hot_threshold}] @var{float}
> >> +
> >>  @c man end
> >>  @c man begin SEEALSO
> >>  gpl(7), gfdl(7), fsf-funding(7), gcc(1), gcov(1) and the Info entry for
> >> @@ -182,8 +190,42 @@ or simple fraction value form, such 1, 2, 2/3, and
> >>  @itemx --normalize <long_long_value>
> >>  Normalize the profile. The specified value is the max counter value
> >>  in the new profile.
> >> +@end table
> >>
> >> +@item overlap
> >> +Computer the overlap score between the two specified profile directories.
> >> +The overlap score is computed based on the arc profiles. It is defined as
> >> +the sum of min (p1_counter[i] / p1_sum_all, p2_counter[i] / p2_sum_all),
> >> +for all arc counter i, where p1_counter[i] and p2_counter[i] are two
> >> +matched counters and p1_sum_all and p2_sum_all are the sum of counter
> >> +values in profile 1 and profile 2, respectively.
> >> +
> >> +@table @gcctabopt
> >> +@item -v
> >> +@itemx --verbose
> >> +Set the verbose mode.
> >> +
> >> +@item -h
> >> +@itemx --hotonly
> >> +Only print info for hot objects/functions.
> >> +
> >> +@item -f
> >> +@itemx --function
> >> +Print function level overlap score.
> >> +
> >> +@item -F
> >> +@itemx --fullname
> >> +Print full gcda filename.
> >> +
> >> +@item -o
> >> +@itemx --object
> >> +Print object level overlap score.
> >> +
> >> +@item -t @var{float}
> >> +@itemx --hot_threshold <float>
> >> +Set the threshold for hot counter value.
> >>  @end table
> >> +
> >>  @end table
> >>
> >>  @c man end
> >


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]